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r^ [ Abstract 

^ ' We show, for any positive integer k, that there exists a graph in which any equitable partition 

of its vertices into k parts has at least cfc^/log* k pairs of parts which are not e-regular, where 
c, e > are absolute constants. This bound is tight up to the constant c and addresses a question 
||0 I of Gowers on the number of irregular pairs in Szemeredi's regularity lemma. 

In order to gain some control over irregular pairs, another regularity lemma, known as the strong 

regularity lemma, was developed by Alon, Fischer, Krivelevich, and Szegedy. For this lemma, we 

prove a lower bound of wowzer-type, which is one level higher in the Ackermann hierarchy than 

v../ ' the tower function, on the number of parts in the strong regularity lemma, essentially matching the 

r^ . upper bound. On the other hand, for the induced graph removal lemma, the standard application 

j^ , of the strong regularity lemma, we find a different proof which yields a tower-type bound. 

We also discuss bounds on several related regularity lemmas, including the weak regularity 
lemma of Frieze and Kannan and the recently established regular approximation theorem. In 
particular, we show that a weak partition with approximation parameter e may require as many as 
^ ' 2^^^ ) parts. This is tight up to the implied constant and solves a problem studied by Lovasz and 

O^ . Szegedy. 
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1 Introduction 

Originally developed by Szemeredi as part of his proof of the celebrated Erdos-Turan conjecture on 
long arithmetic progressions in dense subsets of the integers [39], Szemeredi's regularity lemma |40| has 
become a central tool in extremal combinatorics. Roughly speaking, the lemma says that the vertex 
set of any graph may be partitioned into a small number of parts such that the bipartite subgraph 
between almost every pair of parts behaves in a random-like fashion. 

Given two subsets X and y of a graph G, we write d{X, Y) for the density of edges between X and Y. 
The pair (X, Y) is said to be (e, 6)-regular if for some a and all X' G X and Y' gY with \X'\ > S\X\ 
and \Y'\ > 5\Y\, we have a < d{X' , Y') < a + e. In the case where 6 = e, we say that the pair (X, Y) 
is e-regular. By saying that a pair of parts is random-like, we mean that they are (e, (5)-regular with e 
and 5 small, a property which is easily seen to be satisfied with high probability by a random bipartite 
graph. We will also ask that the different parts be of comparable size, that is, that the partition 
V{G) = Fi U . . . U Vfc be equitable, that is, ||Vi| — \Vj\\ < 1 for all i and j. 
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The regularity lemma now states that for each €,5,ri > 0, there is a positive integer M = M(e, 5, rf) 
such that the vertices of any graph G can be equitably partitioned V{G) = yi U . . . U Vm into M parts 
where ah but at most an rj fraction of the pairs {Vi^Vj) are (e, 5)-regular. We shah say that such a 
partition is (e, 5, ry)-regular and simply e-regular in the case e = 6 = rj. For more background on the 
regularity lemma, see the excellent surveys by Komlos and Simonovits [27] and Rodl and Schacht [33] . 

Use of the regularity lemma is now widespread throughout graph theory. However, one of the earliest 
applications, the triangle removal lemma of Ruzsa and Szemeredi [3H]) remains the standard example. 
It states that for any e > there exists 6 > such that any graph on n vertices with at most 6n^ 
triangles can be made triangle-free by removing en^ edges. It easily implies Roth's theorem [3l] on 3- 
term arithmetic progressions in dense sets of integers, and Solymosi [37] showed that it further implies 
the stronger corners theorem of Ajtai and Szemeredi [T], which states that any dense subset of the 
integer grid contains the vertices of an axis-aligned isosceles triangle. This result was extended to all 
graphs in |17l [3]. The extension, known as the graph removal lemma, says that given a graph H on h 
vertices and e > there exists 6 > such that any graph on n vertices with at most dn'^ copies of H 
can be made H-bee by removing ev? edges. 

One disadvantage of applying the regularity lemma to prove this theorem is the bounds that it gives 
for the size of 5 in terms of e. The proof of the regularity lemma yields a bound of tower-type for 
the number of pieces in the partition. When this is applied to graph removal, it gives a bound for 
6~^ which is a tower of twos of height polynomial in e~^. Surprisingly, any hope that a better bound 
for the regularity lemma might be found was put to rest by Gowers [22], who showed that there are 
graphs for which a tower-type number of parts are required in order to obtain a regular partition. 

To be more precise, the proof of the regularity lemma shows that M(e, 6, rj) can be taken to be a tower 
of twos of height proportional to e~'^5~'^rj~^ . Gowers' result, described in |13j as a tour de force, is a 
lower bound, with c = 1/16, on A/(l — 5'^, <^, 1 — 5^^) which is a tower of twos of height proportional to 
5~^. As Gowers notes, it is an easy exercise to translate lower bounds for small 5 and large e into lower 
bounds for large 5 and small e which are also of tower-type. However, the natural question, discussed 
by Szemeredi [lO], Komlos and Simonovits [27], and Gowers [22], of determining the dependency of 
M(e, 5, rf) on 77, which measures the fraction of allowed irregular pairs, has remained open. This is the 
first problem we will address here, showing that the dependence is again of tower-type. 

This does not mean that better bounds cannot be proved for the graph removal lemma. Recently, an 
alternative proof was found by the second author [TS], allowing one to show that 5~^ may be taken to 
be a tower of twos of height 0(loge~^), better than one could possibly do using regularity. Though 
this remains quite far from the lower bound of e~*^(^°g'^ ), it clears a significant hurdle. 

The second major theme of this paper is a proof of the induced graph removal lemma which similarly 
bypasses a natural obstacle. Let if be a graph on h vertices. The induced graph removal lemma, 
proved by Alon, Fischer, Krivelevich, and Szegedy [S], states that for any e > there exists 5 > Q such 
that any graph on n vertices with at most 5n^ induced copies of H may be made induced H-fiee by 
adding or deleting at most en^ edges. 

This result, which easily implies the graph removal lemma, does not readily follow from the same 



technique used to prove the graph removal lemma, mainly because of the possibility of irregular 
pairs in the regularity partition. To overcome this issue, Alon, Fischer, Krivelevich, and Szegedy [S] 
developed a strenghthening of Szemeredi's regularity lemma. Roughly, it gives an equitable partition 
A and an equitable refinement B oi A such that A and B are both regular, with the guaranteed 
regularity of B allowed to depend on the size of A, and the edge density between almost all pairs of 
parts in B close to the edge density between the pair of parts in A that they lie in. 

The proof of the strong regularity lemma involves iterative applications of Szemeredi's regularity 
lemma. This causes the upper bound on the number of parts in B to grow as a wowzer function, 
which is one level higher in the Ackermann hierarchy than the tower function. In order to get an 
improved bound for its various applications, one may hope that an improved bound of tower-type 
could be established. We show that no such bound exists. In fact, we will show that a seemingly 
weaker statement requires wowzer-type bounds. On the other hand, we give an alternative proof of 
the induced graph removal lemma, allowing one to show that S~^ may be taken to be a tower of twos 
of height polynomial in e~^, better than one could possibly achieve using the strong regularity lemma. 

We also make progress on determining bounds for various related regularity lemmas, including the 
Frieze-Kannan weak regularity lemma |19[ [20] and the regular approximation theorem, due indepen- 
dently to Lovasz and Szegedy [29] and to Rodl and Schacht [32]. We discuss all these contributions in 
more detail in the sections below. 

1.1 The number of irregular pairs 

The role of rj in the regularity lemma is to measure how many pairs of subsets in the partition are 
regular. If a partition into k pieces is (e, (5, r/)-regular, then there will be at most 77(2) irregular pairs 
in the partition. Szemeredi [IQ] wrote that it would be interesting to determine if the assertion of the 
regularity lemma holds when we do not allow any irregular pairs. This question remained unanswered 
for a long time until it was observed by Lovasz, Seymour, Trotter, and Alon, Duke, Lefmann, Rodl, 
and Yuster [3] that irregular pairs can be necessary. The simple example of the half-graph shows that 
this is indeed the case. The half-graph is a bipartite graph with vertex sets A = {ai, . . . ,0^} and 
B = {bi, . . . ,bn} in which (oj, bj) is an edge if and only if i < j. Any partition of this graph into M 
parts will have Q{M) irregular pairs. In other words, M{e,6,ri) must grow at least linearly in r/"^. 

However, the number of irregular pairs, or, in other words, the dependence of M(e, 6, rj) on r]~^ with 
e and 5 fixed, has not been well understood despite being asked several times, including by Komlos 
and Simonovits [27] and, more explicitly, by Gowers [22]. This problem and related problems have 
continued to attract interest (see, e.g., [26], [30]). The linear bound obtained from the half-graph 
appears to be the only bound in the literature for this problem. 

For fixed constants e and 6 and each M, we give a construction in which any partition into M parts 
has at least cM'^ / log* M irregular parts, where c > is an absolute constant, and this is tight apart 
from the constant c. The iterated logarithm log* n is the number of times the logarithm function 
needs to be applied to get a number which is at most 1. That is, log* a; = if a; < 1 and otherwise 
log* j; = 1 -|- log*(loga;) denotes the iterated logarithm. In other words, the dependence on rj in 



M(e, 6, rj) is indeed a tower of twos of height proportional to @{r] ^). 

Theorem 1.1 There are absolute constants c,e,6 > such that for every k there is a graph in which 
every equitable partition of the graph into k parts has at least ck"^/ log* k pairs of parts which are not 
(e, S) -regular. In other words, M{e, 6, rj) is at least a tower of twos of height cr]~^ . 

We prove Theorem 11.11 with e = i, 5 = 2"^"*^, and c = 2"'^*''^, and we make no attempt to optimize 
constants. The proof of Theorem 1 1 . 1 1 can be easily modified to obtain the same result with e tending 
to 1 at the expense of having 5 and c tending to 0. 

In the important special case where e = (5 = r/, we let M(e) = M(e, 5, rf). Gowers [22] gave two different 
constructions giving lower bounds on M(e). The first construction is simpler, but the lower bound 
it gives is a tower of twos of height only logarithmic in e~^. The second construction gives a lower 
bound which is a tower of twos of height e^^'^^, but is more complicated. Theorem 11.11 also gives a 
lower bound on M(e) which is a tower of twos of height polynomial in e~^, in fact linear in e~^, and 
the construction is a bit simpler. Unfortunately, the proof that it works, which builds upon Gowers' 
simpler first proof, is still rather complicated and delicate. 

We give a rough idea of how the graph G used to prove Theorem 1 1.1 1 is constructed. The graph G has a 
sequence of vertex partitions Pi, . . . , Pg, with Pj+i a refinement of Pj for 1 < i < s— 1, and the number 
of parts of Pj+i is roughly exponential in the number of parts of Pj. For each i,l<i<s — 1, we 
pick a random graph Gi with vertex set Pj, where each edge is picked independently with probability 
Pi- For every two vertex subsets X, y G Pj of G which are adjacent in Gj, we take random vertex 
partitions X = Xy U Xy and Y = Y^ U Y^ into parts of equal size, with each of these parts the union 
of parts of Pj+i. Then, for d = 1,2, we add the edges to G between Xy and Y^. We will show that 
with positive probability the graph G constructed above has the desired properties for Theorem ll.il 
In fact, in Theorem 13. 1|, we will show that it has the stronger property that any (e, 6, r/)-regular vertex 
partition of G is close to being a refinement of Pg. 

A novelty of our construction, not present in the constructions of Gowers, is the use of the random 
graphs Gi, which allow us to control the number of irregular pairs. Instead, for every pair of parts X, Y 
in Pj, Gowers |22j introduces or deletes some edges between them so as to make the pair of parts far 
from regular. To prove the desired result, we first establish several lemmas on the edge distribution in 
G. The construction is general enough and Theorem 13. II strong enough that we also use it to establish 
a wowzer-type lower bound for the strong regularity lemma, as described in the next subsection. 

1.2 The strong regularity lemma 

Before stating the strong regularity lemma, we next define a notion of closeness between an equitable 
partition and an equitable refinement of this partition. For an equitable partition A = {^|1 < i < k} 
of V{G) and an equitable refinement B = {Vij\l < i < k,l < j < £} oi A, we say that B is e-close 
to A if the following is satisfied. All 1 < i < i' < k but at most e/c^ of them are such that for all 
1 < j,j' < ^ but at most ei'^ of them \d{Vi,Vii) — d(yij,Viiji)\ < e holds. This notion roughly says 



that B is an approximation of A. We are now ready to state the strong regularity lemma of Alon, 
Fischer, Krivelevich, and Szegedy [S]. 

Lemma 1.1 (Strong regularity lemma) For every function / : N — t- (0, 1) there exists a number 
S = S{f) with the following property. For every graph G = {V,E), there is an equitable partition 
A of the vertex set V and an equitable refinement B of A with \B\ < S such that the partition A is 
f{l)-regular, the partition B is f{\A\) -regular, and B is f{l)-close to A. 

The upper bound on S, the number of parts of B, that the proof gives is of wowzer-type, which is 
one level higher in the Ackermann hierarchy than the tower function. The tower function is defined 
by T(l) = 2, and T{n) = 2 '""^^ for n > 2. The wowzer function W{n) is defined by Vl^(l) = 2 
and W{n) = T(W{n — 1)). For reasonable choices of the function /, which is the case for all known 
applications, such as those for which 1// is an increasing function which is at least a constant number 
of iterations of the logarithm function, the upper bound on S{f) is at least wowzer in a power of 
e = /(I). Recall that M(e), the number of parts required for Szemeredi's regularity lemma, grows 
as a tower of height a power of e~^. The precise upper bound on the number of parts in the strong 
regularity lemma is defined as follows. Let Wi = M(e) and Wi+i = M{2f(Wi)/W^). The proof of 
the strong regularity lemma [S] shows that S{f) = 512e~^Wj- with j = 64e~^ satisfies the required 
property. 

For a partition V : V{G) = Vi U . . . U 14 of the vertex set of a graph G, the mean square density of V 
is defined by 

q{V) = J2dHV,,Vj)p,pj, 

where pi = \Vi\/\V{G)\. This function plays an important role in the proof of Szemeredi's regularity 
lemma and its variants. 

The strong regularity lemma gives a regular partition A, and a refinement B which is much more 
regular and is close to A. For equitable partitions A and B with B a refinement of A, the condition B 
is e-close to A is equivalent, up to a polynomial change in e, to q{B) < q{A) + e. Indeed, if B is e-close 
to A, then q{B) < q{A) + 0(e), while if q{B) < q{A) + e, then B is 0(e^/^)-close to A. A version of 
this statement is present in Lemma 3.7 of [5]. As it is suffiicent and more convenient to work with 
mean square density instead of e-closeness, we do so from now on. 

Note that in the strong regularity lemma, without loss of generality we may assume / is a (mono- 
tonically) decreasing function. Indeed, this can be shown by considering the decreasing function 
f'{k) := mini<j<fc f{k). From the above discussion, it is easy to see that the strong regularity lemma 
has the following simple corollary, with a similar upper bound. 

Corollary 1.1 Let e > and / : N — t- (0, 1) be a decreasing function. Then there exists a number 
S = S{f, e) such that for every graph G there are equitable partitions A, B of the vertex set of G with 
\B\ < S, q{B) < q{A) + e, and B is f{\A\) -regular. 

We prove a lower bound for the strong regularity lemma of wowzer-type, which essentially matches the 
upper bound. Maybe surprisingly, our construction further shows that much less than what is required 



from the strong regularity lemma already gives wowzer-type bounds. In particular, even for Corollary 
11.11 which appears considerably weaker than the strong regularity lemma, we get a wowzer-type lower 
bound. Note that in Corollarv ll.il B is not required to be a refinement of A. In this case we could 
have q{B) being close to q{A) but the edge densities between the parts in these partitions are quite 
different from each other, i.e., these partitions are not close to each other. 

Theorem 1.2 Let < e < 2~^°° and / : N -^ (0, 1) be a decreasing function with /(I) < 2~^°°e^. 
Define Wi recursively by Wi = 1, M^^+i = T (2^'^^e^ / fiW^j), where T is the tower function. Let 
W = Wt-i with t = 2~'^^e~^. Then there is a graph G such that if equitable partitions A,B of the 
vertex set of G satisfy q{B) < q{A) + e and B is f{\A\) -regular, then \A\, \B\ > W. 

We have the following corollary (by replacing e by e^'^), which is a simple to state lower bound of 
wowzer-type. 

Corollary 1.2 For < e < 2"'^"'^, there is a graph G such that if equitable partitions A,B of the 
vertex set of G satisfy \B\ > \A\, q{B) < q{A) + e and B is e/\A\-regular, then |;B|,|^| are bounded 
below by a function which is wowzer in Q{e~~^''^). 

1.3 Induced graph removal 

Let H he a fixed graph on h vertices and let G be a graph with o{n^) copies of H. To prove the graph 
removal lemma, we need to prove that all copies of H can be removed from G by deleting o(n^) edges. 
The standard approach is to apply the regularity lemma to the graph G to obtain an e-regular vertex 
partition (with an appropriate e) into a constant number of parts M(e). Then delete edges between 
pairs of parts {Vi,Vj), including i = j, if the pair is not e-regular or the density between the pair is 
small. It is easy to see that there are few deleted edges. Furthermore, if there is a copy of H in the 
remaining subgraph, then the edges go between pairs of parts which are e-regular and not of small 
density. A counting lemma then shows that in such a case the number of copies of H is Q,{n ) in the 
remaining subgraph, and hence in G as well. But this would contradict the assumption that G has 
o{n^) copies of H, so all copies of H must already have been removed. 

Recall that the induced graph removal lemma [S] is the analogous statement for induced subgraphs, 
and it is stronger than the graph removal lemma. It states that for any graph H on h vertices and 
e > there is 5 = 5{e, H) > such that if a graph G on n vertices has at most 6n induced copies of 
H, then we can add or delete en^ edges of G to obtain an induced H-free graph. 

One well-known application of the induced graph removal lemma is in property testing. This is an 
active area of computer science where one wishes to quickly distinguish between objects that satisfy 
a property from objects that are far from satisfying that property. The study of this notion was 
initiated by Rubinfield and Sudan [35], and subsequently Goldreich, Goldwasser, and Ron [21] started 
the investigation of property testers for combinatorial objects. One simple consequence of the induced 
graph removal lemma is a constant time algorithm for induced subgraph testing with one-sided error 
(see [2] and its references). A graph on n vertices is e-far from being induced H-free if at least en^ 



edges need to be added or removed to make it induced H-hee. The induced graph removal lemma 
implies that there is an algorithm which runs in time Oe(l) which accepts all induced H-free graphs, 
and rejects any graph which is e-far from being induced H-hee with probability at least 2/3. The 
algorithm samples t = 26~^ /i-tuples of vertices uniformly at random, where 6 is picked according 
to the induced graph removal lemma, and accepts if none of them form an induced copy of H, and 
otherwise rejects. Any induced H-free graph is clearly accepted. If a graph is e-far from being induced 
H-iree, then it contains at least 6n copies of H, and the probability that none of the sampled /i-tuples 
forms an induced copy of H is at most (1 — 5)* < 1/3. Notice that the running time as a function of e 
depends on the bound in the induced graph removal lemma, and the proof using the strong regularity 
lemma gives a wowzer-type dependence. 

It is tempting to try the same approach using Szemeredi's regularity lemma to obtain the induced 
graph removal lemma. However, there is a significant problem with this approach, which is handling 
the pairs between irregular pairs. To get around this issue, Alon, Fischer, Krivelevich, and Szegedy 
[5] developed the strong regularity lemma. 

Because of its applications, including those in graph property testing, it has remained an intriguing 
problem to improve the bound in the induced graph removal lemma. This problem has been discussed 
in several papers by Alon and his collaborators [2], [6], [8]. The main result discussed in this subsection 
addresses this problem, improving the bound on the number of parts in the induced graph removal 
lemma from wowzer-type to tower-type. The tower function ti{x) is defined by tQ{x) = x and tj+i(x) = 
2*H^). We say that ij(x) is a tower in x of height i. 

Theorem 1.3 For any graph H on h vertices and < e < 1/2 there is 6 > with 5^^ a tower in h 
of height polynomial in e'^^ such that if a graph G on n vertices has at most 6n^ induced copies of H, 
then we can add or delete en^ edges of G to obtain an induced H-free graph. 

The following lemma is an easy corollary of the strong regularity lemma which was used in [S] to 
establish the induced graph removal lemma. 

Lemma 1.2 For each < e < 1/3 and decreasing function / : N — t- (0, 1/3) there is 5' = (5'(e, /) such 
that every graph G = {V,E) with \V\ > 6'~^ has an equitable partition V = ViU . . .UVk and vertex 
subsets Wi C Vi such that \Wi\ > S'\V\, each pair {Wi,Wj) with l<i<j<kis f{k)-regular, and all 
but at most ek'^ pairs 1 < i < j < k satisfy \d{Vi, Vj) — d{Wi, Wj)\ < e. 

In fact. Lemma 11.21 is a little bit stronger than the original version in [5] in that each set Wi is 
/(fc)-regular with itself. The original version follows from the strong regularity lemma by taking the 
partition y = Vi U . . . U Vfc to be the partiton A in the strong regularity lemma, and the subset Wi to 
be a random part Vij C Vi oi the refinement ;B of ^ in the strong regularity lemma. 

From this slightly stronger version, the proof of the induced graph removal lemma is a bit simpler and 
shorter. Indeed, with f{k) = -^e^, which does not depend on k, if there is a mapping (j) : V{H) — )■ 
{1,. . . ,k} such that for all adjacent vertices v,w of H, the edge density between VF^(„) and W^(^„) 
is at least e, and for all distinct nonadjacent vertices v,w of H, the edge density between Wm^-j and 



Wmw) is at most 1 — e, then a standard counting lemma shows that G contains at least 6n^ induced 
copies of H, where 6 = {e/A)y2)S'f^, Hence, we may assume that there is no such mapping (p. We then 
delete edges between Vi and Vj if the edge density between Wi and Wj is less than e, and one adds 
the edges between Vi and Vj if the density between Wi and Wj is more than 1 — e. The total number 
of edges added or removed is at most Seji^, and no induced copy of H remains. Replacing e by e/8 in 
the above argument gives the induced graph removal lemma. 

We find another proof of Lemma 11.21 with a better tower-type bound. This in turn implies, by 
the argument sketched above, the tower-type bound for the induced graph removal lemma stated in 
Theorem 11.31 

The starting point for our approach to Lemma 11.21 is a weak regularity lemma due to Duke, Lefmann 
and Rodl J15j . This lemma says that for a /c-partite graph, between sets Vi, V2, . . . , 14, there is an 
e-regular partition of the cylinder Vi x ■ ■ ■ x Vk into a relatively small number of cylinders K = 
Wi X ■ ■ ■ X Wk, with Wi C Vi for 1 < i < k. The definition of an e-regular partition here is that all 
but an e-fraction of the /c-tuples (f 1 , . . . , Vk) G Vix ■ ■ ■ xV^ are in e-regular cylinders, where a cylinder 
Wi X ■ ■ ■ X Wk is e-regular if all (2) pairs {Wi, Wj), I < i < j < k, are e-regular in the usual sense. 

In the same way that one derives the strong regularity lemma from the ordinary regularity lemma, 
we show how to derive a strong version of this lemma. We will refer to this strengthening, of which 
Lemma ll. 21 is a straightforward consequence, as the strong cylinder regularity lemma. It will also be 
convenient if, in this lemma, we make the requirement that a cylinder be regular slightly stronger, by 
asking that each Wi be regular with itself. That is, we say that a cylinder Wi x ■ ■ ■ x Wk is strongly 
e-regular if all pairs {Wi, Wj) with I < i,j < k are e-regular. A partition /C of the cylinder Vix ■ ■ ■ xVk 
into cylinders K = Wi x ■ ■ ■ x Wk, with Wi C Vi for 1 < i < k, is then said to strongly e-regular if all 
but an e-fraction of the fc-tuples {vi, . . . , Vk) £ Vi x ■ ■ ■ x Vk are in strongly e-regular cylinders. 

Let P : y = yiU---UVfcbea partition of the vertex set of a graph and /C be a partition of the 
cylinder Vi x ■ ■ ■ xVk into cylinders. For each K = Wi x ■ ■ ■ x Wk, with Wi C Vi for 1 < i < A;, we let 
Vi{K) = Wi. We then define the partition Q(/C) of V to be the refinement of P which is the common 
refinement of all the parts Vi{K) with i G [A;] and K £ IC. The strong cylinder regularity lemma is 
now as follows. 

Lemma 1.3 For < e < 1/3, positive integer s, and decreasing function / : N — )• (0, e], there is S = 
S{e, s, f) such that the following holds. For every graph G, there is an integer s < k < S, an equitable 
partition P : V = ViU . . . UVk and a strongly f{k) -regular partition K, of the cylinder Vi x ■ ■ ■ x Vk 
into cylinders satisfying that the partition Q = Q{IC) of V has at most S parts and q{Q) < q{P) + e. 
Furthermore, there is an absolute constant c such that letting si = s and Sj+i = t4:{{si/ f{si)Y), we 
may take S = si with £ = 2e~^ + 1. 

In order to prove this lemma, we need, in addition to the Duke-Lefmann-Rodl regularity lemma, a 
lemma showing that for each e > there is 5 > such that every graph G = {V, E) contains a vertex 
subset U with \U\ > 6\V\ which is e-regular with itself, where, crucially, 5~^ is bounded above by a 
tower function of e~^ of absolute constant height. While seemingly standard, we do not know of such 
a result in the literature. 



Lemma 11.21 follows from Lemma 11.31 by considering a random cylinder K in the cylinder partition /C, 
with each cylinder picked with probability proportional to its size, and letting Wi = Vi{K). 

1.4 Prieze-Kannan weak regularity lemma 

Frieze and Kannan [19], [20] developed a weaker notion of regularity which is sufficient for certain 
applications and for which the dependence on the approximation e is much better. It states the 
existence of a vertex partition into a small number of parts for which the number of edges across any 
two vertex subsets is within en? of what is expected based on the edge densities between the parts of 
the partition and the intersection sizes of the vertex subsets with these parts. 

Lemma 1.4 (Frieze-Kannan ■weak regularity lemma) For each e > there is a positive integer 
k{e) such that every graph G = {V, E) has an equitable vertex partition y = Vi U . . . U I4. with k < k{e) 
satisfying that for all subsets A,B^V, we have 

\e{A,B)- Y, d{Vi,Vj)\AnVi\\BnVj\\<e\V\\ 

l<i,j<k 

The weak regularity lemma has a number of algorithmic applications. Frieze and Kannan [20] used the 
weak regularity lemma to give constant-time approximation algorithms for some general problems in 
dense graphs, a special case being the Max-Cut of a graph. Recently, Bansal and Williams [12] used the 
weak regularity lemma to obtain a faster combinatorial algorithm for Boolean matrix multiplication. 
The importance of the weak regularity lemma is further discussed in the citation of the recent Knuth 
Prize to Kannan. 

As there are several applications of the weak regularity lemma to fundamental algorithmic problems, 
we would like to know the correct bounds on the number of parts for the weak regularity lemma. The 
proof of the weak regularity lemma [20] shows that we may take k{e) = 2^*^ K If this upper bound 
could be improved, it would lead to faster algorithms for several problems of interest. Lovasz and 
Szegedy [29] studied the problem of estimating the minimum number of parts fc(e) required for the 
weak regularity lemma, proving a lower bound on k{e) of the form 2^^^ ' . Here we close the gap by 
proving a new lower bound which matches the upper bound. 

Theorem 1.4 For each e > 0, there are graphs for which the minimum number of parts in a weak 
regular partition with approximation e is 2^^'^ > . 

A careful analysis of the proof of Theorem 11.41 shows that the number of parts required in the weak 
regularity lemma with approximation e is at least 2~^ ^ for < e < 2~^^. In fact, the theorem 
yields a stronger result, since we do not here require that the partition be equitable. 

While the number of parts in the weak regularity lemma is 2^*^ ', the proof obtains the partition 
as an overlay of only 0(e~^) sets. As discussed in [29], in some applications, such as in @], this can 
be treated as if there were only about 0(e~^) classes, which makes the weak regularity lemma quite 
efficient. It was shown in [3], and is also implied by Theorem \1A\ that the partition cannot be the 
overlay of fewer sets. 



1.5 The regular approximation lemma 

Another strengthening of Szemeredi's regularity lemma came from the study of graph limits by Lovasz 
and Szegedy [29] , and also from work on the hypergraph generalization of the regularity lemma by Rodl 
and Schacht [32]. This regularity lemma, known as the regular approximation lemma [33], provides 
an arbitrary precision for the regularity as a function of the number of parts of the partition if an 
e-fraction of the edges are allowed to be added or removed. 

For a function 51 : N — )■ (0, 1), a partition of the vertex set into k parts is g-regular if all pairs of distinct 
parts in the partition are g{k)-Teg\i\ax. 

Lemma 1.5 (Tlegular approximation lemmaj For every e > 0, positive integer s and decreasing 
function g : N — )■ (0, 1), there is an integer T = T(g, e, s) so that given a graph G with n vertices, one 
can add-to/remove-from G at most ev? edges and thus get a graph G' that has a g-regular equitable 
partition of order k for some s < k < T. 

Lovasz and Szegedy [2U] state that the regular approximation lemma is equivalent to the strong 
regularity lemma, Lemma 11.11 It is not difficult to deduce Lemma 11.51 from the strong regularity 
lemma, see [9] or [33] for details. Unlike the original graph limit approach, this proof of the regular 
approximation lemma gives explicit bounds and yields a polynomial time algorithm for finding the 
partition and the necessary edge modifications. In the other direction, by applying Lemma 11.51 with 
1/g a tower in the 1// from Lemma ll.H letting A be the (7-regular partition of G' , and then using 
Szemeredi's regularity lemma to get a refinement B oi A which is an /(^)-regular partition of G, it is 
easy to deduce the strong regularity lemma. 

The major caveat here is the additional use of Szemeredi's regularity lemma in deducing the strong 
regularity lemma from the regular approximation lemma. Due to the additional use of Szemeredi's 
regularity lemma, it does not rule out the possibility that the wowzer-type upper bound on T in the 
regular approximation lemma can be improved to tower-type. Maybe surprisingly, we indeed make 
such an improvement. 



Theorem 1.5 For e > 0, positive integer s and a decreasing function (^ : N — )• (0, 1), let 6{t) 

32P"' 



min(^^,e/2). Let ti = s and for i > 1 let tj+i = tik{6{ti)), where k is as in the weak regularity 



lemma, so k{a) = 2 '° >. Let Tq = tj with j = 4e~^. Then the regular approximation lemma holds 
with T = 16To/(5(To)^. In other words, the regular approximation lemma holds with a tower-type 
bound. 

It is usually the case that l/g{t) in the regular approximation lemma is at most a tower of constant 
height in e~^ and t, and in this case the upper bound T on the number of parts is only a tower of 
height polynomial in e~^. Only in the unusual case of 1/g being of tower-type growth does the number 
of parts needed in the regular approximation lemma grow as wowzer-type. 

Alon, Shapira, and Stav [S] give a proof of the regular approximation lemma which yields a polynomial 
time algorithm for finding the partition and the necessary edge modifications. Similarly, our new proof 
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can be made algorithmic with a polynomial time algorithm for finding the partition and the necessary 
edge modifications. Making the proof algorithmic is essentially the same as done in [S], so we do not 
include the details. 

A partition of a graph satisfying the weak regularity lemma, Lemma II. 4| is called a weak e-regular 
partition. Tao showed [U] (see also [33]), by iterating the weak regularity lemma, that one obtains the 
following regularity lemma which easily implies Szemeredi's regularity lemma with the usual tower- 
type bounds. 

Lemma 1.6 For all e > 0, positive integers s and functions J : N — )• (0,1), there is a Tq such that 
every graph has an equitable vertex partition P into t > s parts which is weak e-regular, an equitable 
vertex refinement Q into at most Tq parts which is weak 6{t)-regular, and q{Q) < q{P) + e. 

Let ti = s, and for i > 1, let tj+i = tik{S{ti)), where k is as in the weak regularity lemma. Recall A;(e) 
is exponential in e~^. Then Tq in Lemma 11.61 is given by Tq = tj with j = e~^. In particular, if 6~^ is 
bounded above by a tower of constant height, then Tq in Tao's regularity lemma grows as a tower of 
height linear in e~^. 

Our proof of Theorem [T3] shows that the regular approximation lemma is equivalent to Tao's regularity 
lemma with similar bounds. In fact, we show that T in the regular approximation lemma can be taken 
to be T = 16Tq/6{Tq)'^, where Tq = Tq{6, cq, s) is the bound on the number of parts in Tao's regularity 
lemma, 6{t) = min(^^,e/2), and eq = (e/2)^. As Tao's regularity lemma is a simple consequence of 
the regular approximation lemma and an application of the weak regularity lemma, it suffices to show 
how to deduce the regular approximation lemma from Tao's regularity lemma. 

The proof starts by applying Tao's regularity lemma with 6 and eo as above. For each pair {X, Y) of 
parts in Q, where X C A and Y C B with A,B parts of P, we randomly add/delete edges between 
X, Y with a certain probability so that the density between X and Y is about the same as the density 
between A and B. We show that in doing this we have made every pair {A,B) of parts of P g{t)- 
regular with t = \P\. Since q{Q) < q(-P) + eO) the edge density d{X,Y) between most pairs {X,Y) of 
parts of Q is close to the edge density d{A, B) between A and B, and few edges are changed to obtain 
a graph G' for which the partition P is (^-regular. 

We next briefly discuss lower bounds for the regular approximation lemma. In the case g is a, (small) 
constant function, a tower-type lower bound follows from Theorem ll.il \i g is at least a tower function, 
we get a lower bound of wowzer-type from Theorem 11.21 and the fact that the strong regularity 
lemma follows from the regularity approximation lemma with an additional application of Szemeredi's 
regularity lemma as discussed earlier. One could likely come up with a construction giving a general 
lower bound essentially matching Theorem ll.51 but as the already mentioned interesting cases discussed 
above are handled by Theorems 11.11 and II. 2| we do not include such a construction. 

Organization 

In the next section, we prove some useful tools for establishing lower bounds for Szemeredi's regularity 
lemma and the strong regularity lemma. In Section [31 we give a general construction and use it to 
prove Theorem 11.11 which addresses questions of Szemeredi and Gowers on the number of irregular 
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pairs in Szemeredi's regularity lemma. In Section [U we use the general construction to prove Theorem 
11.21 which gives a wowzer-type lower bound on the number of parts of the two partitions in the strong 
regularity lemma. In Section [5l we prove the strong cylinder regularity lemma and use it to prove a 
tower-type upper bound on the induced graph removal lemma. In Section [6l we prove a tower-type 
upper bound on the number of parts in the regular approximation lemma. In Section [71 we prove a 
tight lower bound on the number of parts in the weak regularity lemma. These later sections, Sections 
O [6] and [71 are largely independent of earlier sections and of each other. The interested reader may 
therefore skip forward without fear of losing the thread. 

We finish with some concluding remarks. This includes a discussion showing that in the regularity 
lemma, the condition that the parts in the partition are of equal size does not affect the bounds by 
much. We also discuss an early version of Szemeredi's regularity lemma, and a recent result of Malliaris 
and Shelah which shows an interesting connection between irregular pairs in the regularity lemma and 
the appearance of half-graphs. 

Throughout the paper, we systematically omit floor and ceiling signs whenever they are not crucial 
for the sake of clarity of presentation. We also do not make any serious attempt to optimize absolute 
constants in our statements and proofs. 

2 Tools 

Suppose S = Si + ■ ■ ■ + Sn is the sum of n mutually independent random variables, where for each i, 
Pr[5j = 1] = p and Pr[S'j = 0] = 1 — p. The sum S has a binomial distribution with parameters p and 
n, and has expected value pn. A Chernoff-type estimate (see Theorem A. 1.4 in [10]) implies that for 
a > 0, 

Pt[S -pn>a]< 6-2-^'/" (1) 

By symmetry, we also have Pt[S — pn < —a] < e~'^°' '" and hence Pr[|5 — pn\ > a] < 2e~'^"' '". 

We start by proving a couple of lemmas on the edge distribution of random bipartite graphs with 
different part sizes. Consider the random bipartite graph B = B{m,M) with parts [m] and [M] 
formed by each vertex i G [m] having exactly M/2 neighbors (we assume M is even) in [M] picked 
uniformly at random and independently of the choices of the neighborhoods for the other vertices in 

[m]. 

The following lemma shows that, with high probability, certain simple estimates on the number of 
common neighbors or nonneighbors of any two vertices in B{m,M) hold. 

Lemma 2.1 Let M > m he positive integers with M > 2^*^ even, and < p. < 1/2 be such that 
m > 2/i~^logM. Then, with probability at least 1 — M~^, the random bipartite graph B = B{m,M) 
has the following properties: 

• for any distinct j,j' £ [M], the number of i for which j and j' are either both neighbors of i or 
both nonneighbors of i is less than (^ + fi)m. 
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• for any distinct i,i' G [m], the number of common neighbors of i and i' and the number of 
common nonneighbors of i and i' in [M] are both less than Q + M~^'^) M . 

Proof: Fix distinct j,j' G [M]. For each i G [m], the probabihty that i is adjacent to both j,j' or 
nonadjacent to both j, j' is {^■ — 1)/(M — 1) < i, and these events are independent of each other. 
Therefore, by ([TJ, the probabihty that the number of i for which j and j' are either both neighbors of 
i or both nonneighbors of i is at least {^ + /i)?7i is at most e"^^^™^ '*" = e"^'^ "* < M~ . As there are 
( 2 ) choices for j, j', and \M~'^ > M~^( 2 )> by the union bound we have that B has the first desired 
property with probabihty at least 1 — |M~^. 

As the hyper geometric distribution is at least as concentrated as the corresponding binomial distri- 
bution (for a proof, see Section 6 of [25]), we can apply ([1]) to conclude that for each fixed pair 
i, i' G [m] of distinct vertices the probability that the number of common neighbors of i and i' is at 
least (l + M-^/^) M is at most e-'^i^'^'^'Mf/M ^ e"2^^'^^ Similarly, for each fixed pair i,i' G [m] 
of distinct vertices the probability that the number of common nonneighbors of i and i' in [M] is at 
least (i + M-V^) M is at most g-^^'^". 

As there are C^) choices for i,z' and 2-^"^ — 2e~^ (™), by the union bound we have that B 
has the second desired property with probability at least 1 — |M~^. Hence, with probability at least 
1 — M~^, B has both desired properties, which completes the proof. □ 

The next lemma shows that the edges in B{m,M) are almost surely uniformly distributed between 
large vertex subsets. 

Lemma 2.2 Let M and m be positive integers with M even. With probability at least 1 — M~^ , for 
any Ui C [m] and U2 C [M] with \Ui\ = ui and \U2\ = U2, we have 

\eB{Ui,U2)-\uiU2\<^ff^ (2) 

where 

(CTfl cAd 
ni In h U2 In 
■ui -U2 

Proof: For fixed subsets Ui C [m] and U2 C [M], the random variable eB{Ui,U2), which has 
mean gl^^illf^al) despite not satisfying a binomial distribution, still satisfies the estimate ([1]) for the 
corresponding binomial distribution with parameters 1/2 and |f/i||L^2|- Indeed, note that eB{Ui,U2) 
is the sum of the degrees of the vertices of Ui in U2, and these \Ui\ degrees are identical independent 
random variables, each satisfying a hypergeometric distribution. By Theorem 4 in Section 6 of |25| . 
the expected value of the exponential of a random variable with a hypergeometric distribution is at 
most the expected value of the exponential of the random variable with the corresponding binomial 
distribution. Substituting this estimate into the proof of ([T|) shows that the Chernofi^ estimate also 
holds for eB{Ui,U2)- Hence, the probability ([2]) doesn't hold for a particular pair Ui,U2 is less than 
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2e 2//(ui«2)^ gy ii^Q union bound, the probability that there is a pair of subsets Ui C [m] and U2 C [M] 
not satisfying (I5|) is at most 

V U2 



E eOO— . E E(-) 



ni=l ^2=1 



n 



From the bipartite graph B, we construct equitable partitions {Ai, -Bj)™ ^ of [M], by letting Aj denote 
the set of neighbors of vertex i G [m] in graph B. From Lemmas 12.11 and 12.21 we have the following 
corollary. 

Corollary 2.1 Suppose M > m are positive integers with M > 2'^^ even, and < fi < 1/2 is such 
that m > 2;U~^logAf. There is a bipartite graph B with parts [m] and [M], with each vertex in [m] of 
degree M/2 with the following properties. The estimate ^ holds for all Ui C [m] and U2 C [M] with 
\Ui\ = ui and IC/2I = ^2; o.iT'd B satisfies the two properties in the conclusion of Lemma \2.1\ 

The next lemma is a useful consequence of the equitable partitions (^i,-Bj)^^ behaving randomly. 
Given a vector A G M*^ and 1 < g < 00, write ||A||g for yZ^^-i lAjj'^j and ||A||oo for maxi<j<M |Aj|. 

Lemma 2.3 Let M he a positive even integer, < fi < 1/2, and {Ai,Bi)"^-^ be a sequence of partitions 
satisfying the conclusion of Corollary \2.1\ Suppose that < o", r, a are such that cj, r < 1, a < 1/2, 
and 

(^-/i)(l-a2)>^ + 2(l-T)a(l-a). 

Then for every sequence A = (Ai,...,Aa/) of nonnegative real numbers which are not all zero with 
IIAII2 = (t||A||i, there are at least rm values of i for which min(aj,6j) > a||A||i, where Oj = X^,g^. Aj 
and bi = J2jeB, ^j- 

Proof: Note that by multiplying each Xj by l/||A||i, we may assume without loss of generality that 
||A||i = 1. For distinct j,j' E [M], let {j,j')i denote that j and j' lie in different sets in the partition 
{Ai, Bi). Since for any distinct j,j' G [M], the number of i for which {j,j')i holds is at least (2 — A*)?^, 
we have 

J2 ^i V > (^ - M)rn5] A,(l - A,) = (i - //)m(||A||i - IIAIli) = (^ - /.)m(l - a'), (3) 

where the sum is over all ordered triples {j,j',i) with j,j' distinct and j and j' lie in different sets in 
the partition {Ai,Bi). We have the identity 

Yl ^jV = '^Yaibi. 
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Since Qj + 6j = 1, we have Oj^j < 1/4 and if min(aj, 6j) < a, then aj6j < a(l — a). So if min(aj, 6,/) < a 
for all but less than Tm values of i, then 

y AjAj' < — TTi + 2(1 — T)ma(l — a). 
Ud'h 
Comparing with ([3|) and dividing by m, this contradicts the supposition, and completes the proof. □ 

As usual, G{n,p) denotes the random graph on n vertices chosen by picking each pair of vertices as 
an edge randomly and independently with probability p. We finish this section with a few standard 
lemmas on the edge distribution in G{n,p). 

Lemma 2.4 In G{n,p), with probability at least 1 — n""^ , every pair of disjoint vertex subsets Ui and 
U2 satisfy 

\eiUi,U2) -puiU2\ < VS; (4) 

where ui = \Ui\, U2 = \U2\ and, for ui < U2, g = g{ui,U2) = 2u\U2 In ^. 

Proof: For fixed sets \J\ and C/2) the quantity e{Ui,U2) is a binomial distributed random variable 
with parameters U1U2 and p. By ([1]), we have that the probability dH does not hold is less than 
2e-'^9/{uiu2j^ By the union bound, the probability that there are disjoint sets Ui and U2 for which i^ 
does not hold is at most 

U2 = lui=l ^ ^ ^ ^ U2 = lUl = l ^ ^ ^ ^ 

n U2 / X ~2u2 

U2=lUl = l ^ ' 

The result follows. □ 

Lemma 2.5 In G{n,p), with probability at least 1 — n~'^ , every vertex subset U satisfies 

\e{U)-p(^^\<^, (5) 

where u = \U\ and g = g{u) = ^u^ In ■^. 

Proof: For fixed U , the quantity e{U) is a binomially distributed random variable with parameters 
(2) and p. By ([T|), we have that the probability (0) does not hold is less than 2e~ ^' V2J. By the union 
bound, the probability that there is a vertex subset U for which ([5]) does not hold is at most 
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Combining the estimates from the previous two lemmas, we can bound the probability in G{n,p) that 
there are two not necessarily disjoint subsets with large edge discrepancy between them. 
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Lemma 2.6 In G{n,p), the probability that there are integers ui and U2 with ui < U2 and not neces- 
sarily disjoint vertex subsets Ui and U2 with \Ui\ = ui and \U2\ = U2 such that 

|e(C/i,C/2) - puiU2\ > 5a//i, (6) 

where h = h{ui,U2) = uitt|ln^, is at most 2n^^. 

Proof: For sets Ui and U2, letting U[ = Ui\U2, U2 = U2\Ui, and U = UinU2,we have 

eiUuU2) = e{U[,U2) + 2e{U)+e{U,U^). 

We have that the bounds in Lemmas 12.41 and 12. 51 hold with probability at least 1 — 2n~^. Hence, using 
the triangle inequality, and \U[\ < ui, \U\ < ui, \U2\ < U2, we have 

|e(^i,C/2) - puiU2\ < 2^/g{ui,U2) + 2^/g{ui) + pui < bVh 

with probability at least 1 — 2n~^. Here the extra pui factor comes from the fact that degenerate 
edges are not counted in e{U). □ 



3 A general graph construction 

In this section, we will define a nonuniform random graph G = {V, E) which, assuming certain es- 
timates, has the property that any sufficiently regular partition of its vertex set is close to being a 
refinement of a particular partition of G into many parts. As this particular partition has many parts, 
this will imply that any sufficiently regular partition will have many parts. After defining G, we will 
prove that certain useful estimates on the edge distribution of G hold with positive probability. We 
will use these estimates to show that G has the desired property. 

3.1 Defining graph G 

Following Gowers |22j . we attempt to reverse engineer the proof of Szemeredi's regularity lemma to 
show that the upper bound is essentially best possible. The proof of the regularity lemma follows 
a sequence of refinements of the vertex set of the graph until we arrive at a regular partition, with 
the number of parts in each partition exponentially larger than in the previous partition. We build a 
sequence of partitions of the vertex set, and then describe how the edges of G are distributed between 
the various parts of the partition. To show that any (sufficiently) regular partition Z of V{G) requires 
many parts, we show that Z is roughly a refinement of the partitions we constructed in defining G. 

Let mi > 2"^^^ be a positive integer and p = 2"^*^. For 2 < z < s, let mi = ?7ij_ia.j_i, where 

I 9/10 1 —1/10 

aj_i = 2L'""»-i J . Suppose Pi > m^ for 1 < i < s — 1. 

The vertex set V has a sequence of equitable partitions Pi, . . . ,Ps, where Pj is a refinement of Pi for 
j > i defined as follows. The number of parts of Pi is rrii. For each set X in partition Pj, we pick 
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an equitable partition of X into a^ parts, and let Pj+i be the partition of V with ttt-j+i = mjOj parts 
consisting of the union of these partitions of parts of Pi. 

For 1 < i < s — 1, let Gj be a uniform random graph on Pi with edge probability pi. That is, 
the vertices of the graph are the rui pieces of the partition and we place edges independently with 
probability pi. In practice, we will make certain specific assumptions about the edge distribution of 
Gi but these will hold with high probability in a random graph. For example, we shall assume that 
every vertex in Gi has degree at least pimi/2. 

For each X,Y £ Pi with {X,Y) an edge of Gi, we have an equitable partition QxY '■ X = Xy U Xy 
into two parts, where Xy is a union of some of the parts in Pj+i for j = 1, 2. For each X £ Pi, we shall 
choose the partitions QxY with Y adjacent to X in Gi to satisfy the properties of Corollarv 12.11 with 
fi = 2p^'^ = 2~^. Note that this is possible since we are taking M = aj and m > piUii/l > m- /2, 
so m > 2/i~^ logM, as required. 

We finish the construction of G by defining which pairs of vertices are adjacent. Vertices u,v € V are 
adjacent in G if there is i, 1 < i < s — 1, an edge {X, Y) of Gi, and j G {1, 2} with u G Xy,v G Y^. 

An equivalent way of defining the graph G is as follows. For 1 < j < i, let Gj^i denote the graph with 
vertex set Pi, where X,Y G Pi is an edge of Gj^i if there are X' ,Y' G Pj that are adjacent in Gj, and 
(i G {1, 2} with X C Xy, and Y C Yj^,. For 1 < i < s, let G* denote the graph on Pi whose edge set is 
the union of the edge sets of Gi^i, . . . , Gj-i^j. Finally, two vertices u,v £ V are adjacent in G if there 
is an edge (X, Y) of C with u G X and v ^ Y. Note that G^ is simply the empty graph on Pi. 

We say that a subset Z /3-overlaps another set X if |X n Z| > f3\Z\, that is, if a /3-fraction of Z is in 
X. A set Z is 13 -contained in a partition P of y if there is a set X G P such that Z /3-overlaps X. 

An equitable partition ^ of y is a [13, v) -refinement of a partition P of y if, for at least (1 — v)\Z\ 
sets Z ^ Z, the set Z is [1 — /3)-contained in P. In particular, when (3 = v = Q, this notion agrees 
with the standard notion of refinement. That is, Z \s a. refinement of P is equivalent to Z being a 
(0, 0)-refinement of P. 

Our main result, from which Theorem 11.11 easilv follows, now says that for an appropriate choice of 
Pi, every regular partition of G must be close to a refinement of Ps-i- In the proof of Theorem I l.H 
Theorem 13.11 will be used only in the case a = s — 1. However, for the lower bound on the strong 
regularity lemma in Theorem II. 2| we will need to apply Theorem 13.11 for various values of a. This is 
why the parameter a is introduced. 

Theorem 3.1 Let v = 3 "^'iZi Pij o.f^d suppose pi > l^^-qTiif for 1 < i < a, 1 — 2'^v > e, (3 = 2Qm^ , 

— 1/2 

6 < 13 /A, and v = bm-^ . With positive probability, the random graph G has the following property. 
Every {e, 5, r]) -regular equitable partition of G is a [(3, v) -refinement of Pa. 

3.2 Edge distribution in G 

Having defined the (random) graph G, we now show that with positive probability G satisfies certain 
properties (see Lemma |3. 11 1) concerning its edge distribution which we will use to prove Theorem ll.il 
Note that G is determined by the Gi. For some of the desired properties, it will be enough to show 
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that the edges in each Gj are sufficiently uniform. For other properties, we wih need to consider how 
the edge distribution between the various Gi interact with each other. In bounding the probabihties 
of certain events, we will often consider the probability of the event given Gi is picked at random 
conditioned on the event that Gj with j < i are already chosen. 

In the random graph G{n,p) on n vertices with each edge taken with probability p independently of 
the other edges, the expected degree of each vertex is p{n — 1), and the following simple lemma shows 
that with high probability no vertex will have degree which deviates much from this quantity. We will 
assume throughout this subsection that n > mi > 2"^^^. 

Lemma 3.1 The probability that in the random graph G{n,p) there is a vertex v whose degree satisfies 
\deg{v) — pn\ > n^/'^ is at most e~" 

Proof: For a fixed vertex v, its degree deg{v) follows a binomial distribution with parameters n — 1 
and p. Note that if \<leg{v)—pn\ > v?''^ then also |deg(i))— p(n — 1)| > r?''^ — \. From the Chernoff-type 
estimate ([T]), we get that the probability |deg(7;) —pn\ > r?''^ is at most 2e~^(" ~^^ /("-i) < le""' 
As there are n vertices, from the union bound, we get the probability that there is a vertex v with 
|deg(u) — pn\ > n^'^ is at most e"" . □ 

For X € Pj, we will use A^(^) to denote the neighborhood of X in graph Gj, that is, the set of y € Pj 
such that (X, y) is an edge of Gj. We have the following corollary of Lemma |3. 11 

Corollary 3.1 Let E\ be the event that there isi, 1 < i < s—1, such that Gi has a vertex X with degree 
\N{X)\ satisfying ||A^(X)| — pimi\ > m^ . The probability of event Ei is at most tti := YliZi e~"^i . 

Lemma 3.2 Suppose v = 'i^i^iPi < 1/2. For 2 < i < s — 1, let E2i be the event that Gi has 
less than -gPimf edges which are not edges of G^ . Let E2 be the event that none of the events E2i, 
2 < i < s — 1, occurs. The probability it2 of event E2 is at most vri + "^1=2 e"^*™"''^"^, where tti is 
defined in Corollary \3.1[ 




Proof: If event Ei does not occur, given i^ < 1/2, then the number of edges of G* is at most 

mf/2 < -mj < mf/8. 

Each of the remaining at least (™') — g^^i? > mf/3 unordered pairs of parts of Pi has probabilty pi of 
being an edge of Gj, independently of each other. The expected number of edges of Gj which are not 

2 2 2 

edges of G* is therefore at least ^^^ = ^^^ + ^^^. By ([T]), the probability of event E2i given the 
number of edges of G* is at most mf/8 is at most e~^'P'™»' ^^^ /("^i/3) = e-Pi»^i/24_ guniming over all 
i, the probability of event E2 given Ei does not occur is at most "^1^2 e"^'™*'^'*. We thus have that 
the probability of E2 is at most 7r2 . □ 
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In a graph G with vertex subsets U, W, we let dG{U, W) denote the fraction of pairs in U xW which 
are edges of G. If [/ = {u} consists of a single vertex u, we let dciu, W) = dciU, W). If the underlying 
graph G is clear, we will sometimes write d{U,W) for dG{U,W). The following lemma shows that 
there is a low probability that the density between a vertex and certain vertex subsets is large. 



Lemma 3.3 Let E3 be the event that there is i, 1 < i < s — 2, and distinct X,Y & Pi, d G {1, 2}, and 
V G Xy~ , such that [X,Y) is an edge of Gi but not an edge of G^ , and dG{v,Y^) > u. The probability 
of event E-^ is at most vra := Ylil=i S^=i+i "^i'^j^" p^mj/mi ^ 



Proof: If (X, y) is an edge of Gi but not an edge of G*, then none of the edges of G between 
Cy" and y^ come from the edges of any Gj 



Xy and Yy come from the edges of any Gj with j < i. So for event E^ to occur, there must be 



I <i < j < s-l, X,Y e Pi with (X,y) an edge of Gi, and X' e Pj with X' C X'^''^ , such that 
dG^{X\Y*) > 3pj, where Y* denotes the set of Y' E Pj with Y' CY^. 

Fix for now i, 1 < ? < s — 2, and j with i -\-l < j < s — 1. Fix also an edge {X, Y) of Gi which is 
not an edge of G* and d G {1,2}. Fix a set X' G Pj with X' C Xy"^ and as before let Y* denote 
the set of aU Y' G Pj with Y' C Y^. The probability that dGjiX',Y*) > 3pj is by ([I|) at most 
g-2(2pj|y* 1)2/11^1 = g-4p2mj/m,^ gj^^g |y*| ^ m^^ Summing over all possible choices of i, j, iX,Y), d, 
and X' G Pj with X' C Xy~ , by the union bound we have the probability of event -E3 is at most 

^-^ ^-^ ' 2mi 

i=l j=i+l 

D 

Note that the condition that {X, Y) is an edge of Gi but not of G* is necessary, since it guarantees 
that none of the edges in Gj with j < i contributes to the edges between Xy~ and Y^ in G. If 
(X, Y) was an edge of G*, then we would have a complete bipartite graph between X and Y and hence 
dGiv,Y^) = l. 

The codegree codeg(u, u) of two vertices u and v is the number of vertices w which are connected to 
both u and v. A second useful fact about G{n,p) is that with high probability the codegree of any 
two vertices u and v is roughly p'^n. 

Lemma 3.4 The probability that in the random graph G{n,p) there are distinct vertices u and v with 
\codeg{u,v) — p'^n\ > n^''^ is at most e~" 

Proof: For fixed distinct vertices u and v, the codegree codeg(ti, v) is binomially distributed with 
parameters n — 2 and p^. Note that if |codeg(M, f) — p'^n\ > ?i^'^ then |codeg(u, f) — p^(n — 2)| > 
n^/^ — 2. By Chernoff's inequality ([1]), the probability that |codeg(n, u) — p^n| > n^'^ is at most 
2g-2(n -2) /(n-2) ^ j^-2g-n _ Using the union bound over all (2) choices of u and v yields the 
result. □ 



We have the following corollary of Lemma 13. 4i 
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Corollary 3.2 Let E^ he the event that there is i, 1 < i < s — 1, such that Gi has vertices X,Y £ P^ 
with codegree satisfying \codeg{X,Y) — pfmi\ > m- . The probability of event E^ is at most 7r4 = 

For X £ Pi, let U{X) = [jyeNiX) ^ ■ "^^^ following three lemmas will be used to prove Lemma [3^ 
which bounds the probability that there isi, l<i<s — 1,XgPj, and a vertex v ^ X such that 
dciv, U{X)) > u. The proof, which puts together the next three lemmas, makes sure that it is unlikely 
that any Gj contributes too much to the density between v and U{X). 

Lemma 3.5 Fix 1 < i < s. The probability that there is a pair of distinct sets X,Y £ Pi which satisfy 
dGi{Y,N{X)) > 2pi is at most Tr^i := 26"™''". 

Proof: From Lemma 13.11 we know that |iV(X)| > pirrii — rn^ > 3pim,j/4 for all X £ Pi with 
probability at least 1 — e~"^j . Also, by Lemma |3.4| we have codeg(X, y) < pjmi + m^ < ?>pfmi/2 

_ 1/2 

for all distinct X,Y £ Pi in graph Gj with probability at least 1 — e ™j . The number of edges between 
Y and N{X) in Gi is just the codegree codeg(X,y) of X and Y in Gi. Therefore, dG,{Y,N{X)) = 

_ 1/2 

codeg(X, y)/|A^(X)|. Hence, with probability at least 1 — 2e ""J , we have 

r! (V N( Y\\ - codeg(X, Y) 3pfmi/2 _ 
\N{X)\ Spimi/4 

D 

Lemma 3.6 Fix 1 < i < s. Suppose every vertex of G^ has degree at most Uimi/2, where Vi = 
'^^j<iPj- The probability that there is a pair of distinct sets X ,Y £ Pi which satisfy dQi{Y, N{X)) > f, 

is at most vr^^ := e~"^i + mfe~'^^Pi"^^^^. 

Proof: Note that G* is determined by Gi, . . . , Gj_i. We show that, conditioning on G* has maximum 
degree at most fjmj/2, the random graph Gi is such that the probability that there are distinct sets 
X,Y £ Pi which satisfy dG.(Y,N{X)) > Ui is at most e'^^'" + m^e-'^'P?""^/^ 

Fix for now X,Y £ Pi. Let U be the neighborhood of Y in G*. By assumption \U\ < Viini/2. The 
expectation of \N{X) n U\ is at most piVimi/2. The probability that \N{X) n f7| > 3pii'imi/4: is by 
([T]) at most 

1/2 

By Lemma |3.H we know that |A^(X)| > 2>pinii/A for all X £ Pi with probability at least 1 — e"*"' . 
Therefore, using the union bound, with probability at least 

we have 

^^ ' ^ " \N{X)\ - {3p^m^/4) 

forahXyeP,-. D 
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Lemma 3.7 Fix 1 < i < j < s. Suppose every vertex of Gi has degree at least pimi/2. Let E he the 
event that there is a set X £ Pi and a set Y E Pj with Y (^ X with more than 2pj\N{X)\^- neighbors 

2 

Y' in Gj with Y' C U{X). The probability of event E is at most ir^ij := mimje~^^^^"^^ . 

Proof: The number of Y' £ Pj with Y' C U{X) is |7V(X)|^. The probabihty that a given Y has at 
least 2pj\N{X)\^ neighbors Y' in Gj with Y' C U{X) is, by (HJ, at most 

^-2(p,17V(X)l^)'/(|7V(X)|^) ^ ^-2p^lNiX)\^_ 

As there are at most rajmi such pairs X, Y, we have by the union bound, the probabihty of event E 
is at most 

D 
From the previous three lemmas, we get the next lemma. 

Lemma 3.8 Consider the event E^ that there is i, l<i<s — l,X£Pi, and vertex v ^ X with 
dG{v,U{X)) > V. The probability of event E^ is at mostn^ := T^i+Y^iZi '^5i+Y^iZ2 '^'5i+Y^i<i<:j<:s '^5ij ■ 

Proof: We look at edges in G between U{X) and Y for X £ Pi and Y G Pj, distinguishing three 
different cases, namely j = i, j < i and j > i- For event E^ to occur at least one of the following 
events occurs: 

• There is 1 < i < s and distinct sets X,Y £ Pi with dG^(Y,N{X)) > 3pi. 

• There is 1 < i < s and distinct sets X,Y £ Pi which satisfy dQi{Y,N{X)) > Vi = J2j<i'^Pj- 

• There is 1 < i < j < s and sets X £ Pi and Y £ Pj with Y (;t X with dc^ (Y, U{X)) > 3pj. 

The first case is covered by Lemma |3.5| the second by Lemma 13.61 and the third by Lemma 13.71 For 
Lemmas 13.61 and 13.71 to be applicable, it is enough to know also that for any i and any X £ Pi, 
\N{X) — piTUil < m^ < piVii/A. But this is just the event that Ei does not occur. From Corollary 
13.11 we know this holds with probability at least 1 — tti. 

Therefore, putting everything together, the probability of event E^ is at most 

s-l s~l 

i=l i=2 l<i<j<s 

D 

Lemma 3.9 Fix 1 < i < s — 1. Let E^^i be the event that there is X £ Pi such that X has more than 
z^|A^(X)| neighbors Y in G* with Y £ N{X). Let Eq be the event that at least one of the events Eg^i 
occurs for 1 < i < s — 1. Then, the probability of event Eq is at most ttq := tti + X]i=2 "^je"^'^ Pimi/8_ 
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Proof: Let us assume that event Ei does not occur. Then every vertex in Gj has degree at least 
jPiTrii. Moreover, for every X € Pi, its number of neighbors in C is at most 

1/4 \ ,5 ,1 




12 2 ^ ' 

Furthermore, the graphs Gi and G* are still independently chosen given the degree conditions im- 
posed on them by Ei not occuring. Fix for now X ^ Pi. Then, the expected fraction of elements 
of N[X) which are neighbors of X in G* is at most u/2. Therefore, given |A^(X)|, the probabil- 
ity that the number of neighbors of X in G* is more than z^|A^(X)| is, by ([1]) and the fact that a 
hypergeometric distribution is at least as concentrated as the corresponding binomial distribution, 
at most e~2{^l^(^)l/2)V|A'(^)l = f.-''^\N{x)\/2 ^ gy ^j^g ^xaion bound and the assumption that the de- 
gree of X in Gi is at least jpirm, the probability of event £"6,1 given that Ei does not occur is at 
most mie~^'^ Pimi/8_ Therefore, adding over all i, we get that the probability of event Eq is at most 
TTi + X]i=2 "^je"^*^ PiTrii/s = 7,-g^ as required. □ 

Lemma 3.10 Let Ej be the event that there is i, \ < i < s — 1, and vertex subsets Ui, U2 C Pi of Gi 

with \Ui\ = ui, IC/2I = U2! <ind ui < U2 such that 

\e{Ui, U2) - PiUiU2\ > 5Vh, (7) 

where h = h{ui,U2) = ^1^2 In ^^^. The probability of event Ej is at most vry := X]|=i 2?7i~ . 

Proof: By Lemma l2.6| for each i, the probability that there are subsets Ui,U2 C Pi such that ([7]) 
fails is at most 2m^ . By the union bound, the probability of event Ej is at most "^iZ^ 2mJ . □ 

We gather the previous lemmas into one result, which shows that with positive probability the edge 
distribution of G has certain desirable properties. 

Lemma 3.11 Suppose v = 'i^iZiPi — 1/2- With probability at least 1/2, the graph G has the 
following properties for all i, 1 < i < s — 1. 

• The degree of every vertex in graph Gi differs from piUii by at most m-J and the codegree of 
every pair of distinct vertices differs from pfnii by at most m^ . 

• The number of edges of Gi not in G* is at least pimf/A. 

• For all X £ Pi and vertex v ^ X , we have dciv, U{X)) < v. 

• For all distinct X,Y £ Pi, d G {1,2}, and v G Xy~ , such that {X,Y) is an edge of Gi but X 
and Y are not adjacent in G*, we have dQ{v,Y^) < u. 

• For all X e Pi, X has at most u\N{X)\ neighbors Y in G* with Y e N{X). 
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• For all vertex subsets C/i, f/2 C Pi of graph Gi with \Ui\ = ui, IC/2I = '>J'2: o,nd ui < U2, 

|e(C/i,t/2) - PiUiU2\ < 5Vh, 
where h = h{ui,U2) = uiv^ In ^^. 

Proof: By Corollaries 13.11 13.21 and Lemmas 13.21 13. 3^ 13. 8^ 13. 9^ 13.101 and the union bound , the probability 
that at least one Eh, I < h < 7, occurs is at most S/i=i Pr[-E'/i] < J2h=i'^h- Using the estimates 
p = 2-20, jni > 2^00^ rrir = mr-iOr-i for 2 < r < s, where ar-i = 2^''""^-°^, and pi > mj'''^^ for 
1 < r < s — 1, it is easy to verify that each vr/i < 1/14 and hence the probability that none of these 
events occur, i.e., G has the desired properties, is at least 1/2. D 

For the rest of the proof of Theorem 13. H we suppose that G has the properties described in Lemma 

Em 

3.3 Regular partitions are close to being refinements 

— 1/2 C 

Let 6 = m-^ , C, = uj = 209, /3 = -^, and 7 = 1 — oj. Suppose for contradiction that there is an 
equitable partition Z : V = Zi U . . . U Z^ of the vertex set of G such that all but at most rjk'^ ordered 
pairs {Zj,Z() of parts are (e, (5)-regular, but Z is not a (/?, f )-refinement of Pa- 

The two main lemmas for the proof are Lemmas 13.141 and 13.151 which show that if Zj satisfies certain 
conditions, then there are at least d~^r\k pairs (Zj,Z() that are not (e, (5)-regular. The rest of the 
proof. Theorem 13.21 shows that there are at least Qk Zj which satisfy the conditions of Lemmas 13.141 
or l3.151 Together, we get at least 9~^rjk ■ 6k = r]k^ ordered pairs {Zj,Zi) which are not (e, (5) -regular, 
which completes the proof. 

Since Pi is a partition into mi parts, then, by the pigeonhole principle, each Zj is :^ — contained in Pi. 
We call Zj ripe with respect to r if Zj is /3-contained in Py. but not (1 — /3)-contained in P^. That is, 
Zj is ripe if there is X G P^ containing a /3- fraction of it but no X € P^ containing a (1 — /3)-fraction 
of it. Let ip = 2^^ p. We call Zj shattered with respect to r if Zj is (1 — /3)-contained in P^, but at least 
a ^-fraction of Zj is contained in subsets Xr]Zj with X S Pr+i and \Xr\Zj\ < /3\Zj\. The sense here 
is that Zj is shattered by the partition P^+i if Zj is almost completely contained in some X € P^. but 
it is not well-covered by Pr+i. 

We say that a subset X G V {f3,-y)- supports the partition Z if at least a 7- fraction of the elements 
of X are in sets Zj which /3-overlap X. That is, a 7- fraction of the elements of X are in sets Zj for 
which \XnZj\ >fi\Zj\. 

Lemma 3.12 Each of the mi sets in the partition Pi {f3, 1 — j3mi) -supports Z. 

Proof: Let X G Pi. At most a /3- fraction of V is in sets of the form X n Zj with \X r\ Zj\ < fi\Zj\. 
Hence, as \X\ = \V\/mi, at most a /3?7t,i -fraction of X belongs to Zi which do not /3-overlap X. □ 

CI 

Let Si denote the set of X G Pj which (/3, 7)-support Z. We will let Kj = \-p\. Let Wi denote the set 
of X G Pi for which \N{X) n 5,| < Ki\N(X)\/A. 
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Lemma 3.13 For 1 < i < s — 1 with ki > lOOp^ '^m- '^ln(mie), we have \Wi\ < lOOp^ ^ ln(K^ ^e). 

Proof: In graph Gi, the number e{Wi, Si) of pairs in Wi x Si which are edges is at most kj/4 times 
the sum of the degrees of the vertices in Wi. Since, by Lemma 13.111 every vertex has degree at most 
2pi\Pi\ in Gi, we have e{Wi,Si) < \Wi\ ■ (Ki/4) • 2pi\Pi\ = pi\Si\\Wi\/2. Hence, by Lemma IHTTT] 

Pi\S,\m\/2 < \e{W„Si) -p^mWSiW < 5Vh, 

where h = liiUgln^^, and ui = min(|VFj|, |5j|) and U2 = max(|Wi|, |5j|). By squaring both sides, 



substituting uiU2 = \Wi\\Si\ and simphfying, we have ui < WOp^ hi^^iS. If ui = |5'j| = Kirui, then 

Kiirii = ui < 100p^ In < lOOp^ In(mie), 

U2 

contradicting our assumption. Hence, ui = \Wi\, and \Wi\ < 100p~ ln(K~ e). □ 

The following simple proposition demonstrates the hereditary nature of supporting sets. 

Proposition 3.1 Suppose Y & Pi is such that Y (13, j)- supports the partition Z. Then, for each 
X £ Pi distinct from, Y and d G {1, 2}, Y^ (/3/4, 1/ A) -supports the partition Z. 

Proof: We will use the fact 7 > 7/8. The sum of |.^( n Y^| over all Z^ which /3-overlap Y but do not 
/3/4-overlap Y^ is at most |y|/4. Since Y (/3, 7)-supports the partition, the sum of \Zt n Y^| over all 
Zi which /3/4-overlaps Y^ is at least 

|yil - (1 - 7)|>^| - 11^1/4 > |y|/8 = |yi|/4. 

Hence Y^ (/3/4, l/4)-supports the partition Z. □ 

The following lemma shows that if Z^ satisfies certain conditions, then there are many (at least 6~^r]k) 
Zi such that {Zj,Zi) is not (e, (5)-regular. 

Lemma 3.14 Suppose X £ Pi\ Wi, Ki = \Si\/\Pi\ > 1/2, Zj is shattered with respect to i, and Zj 
(1 — (3)-overlaps X. There are at least 6~^r]k sets Zg £ Z for which {Zj, Zi) is not (e, 5)-regular. 

Proof: Since Zj is shattered with respect to i and Zj (1 — /3)-overlaps X, then \Xr\Zj\ > {1 — /3)\Zj\, 
but the sum of \X' Zj\ over all X' G Pi+i with \X' D Zj\ < (3\Zj\ is at least ip\Zj\. 

Let Z'j = Xn Zj, so \Z'j\ > (1 - /3)\Zj\. For each X' £ Pj+i with X' C X and \X' n Zj\ < ^\Zj\, let 

\x' = \X' ^Z'-\/\Zj\, so each \x' < /3, i.e., /3 > ||A||oo- Also, ||A||i > il^ — (3 follows from the facts that 

\X nZj\> {1- /3)\Zj\ and the sum of \X' D Zj\ over all X' £ Pj+i with \X' D Zj\ < /3\Zj\ is at least 

ip\Zj\. Therefore, 

..^IIAIby.llAlloo^ /3 1 ^ .^19 

"" "' ' - <^_/3-220-i <^ • 
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By Lemma 12.31 with a = 1/8, /x = 2p^''^ = 2 ^, cr < 2 ^, and r = 1 — 2 ^, we have that the number 
of y G N{X) for which 

\Zjr\X].\,\Zjr\Xl\ > a||A||i|Zj| >a{il;-p)\Zj\ (8) 

is at least (1 — 2~^)\N{X)\, where N{X) is the neighborhood of X in graph Gj. 

By Lemma |3. 11 1 the number of y G ^{^) which are also adjacent to X in G* is at most !/|A^(X)|. 
Also, since X Wi, we have \N{X)r\Si\ > Hii\N{X)\/A. Therefore, the number of Y G Sj with {X,Y) 
an edge of Gi but not an edge of G* , and Q is satisfied is at least 

(1 - 2-5)|iV(X)| - \N{X) \ 5,1 - u\N(X)\ > {Ki/4. - 2"^ - iy)\N{X)\ > |A^(X)|/16. 

Fix such a Y, and let Ud = Zj n X^ for d G {1,2}, so \Ui\, IC/2I > a{i; - P)\Zj\. Since Y G S",, we 
have Y (/?, 7)-supports Z. By Proposition 13.11 Y^ (/3/4, l/4)-supports Z. By Lemma [3.111 for each 
vertex v G Xy~ , we have d{v,Y^) < v. In particular, d{U-i-.d,Y^) < z^. Let i?'^ be the union of all 
Z^ n Y^ such that Zi /3/4-overlaps Y^, so i?'^ is a subset of Y^ of cardinality at least |Y^|/4. Hence, 

d(f/3-d,/?'^)<4z.. 

For Zi which /3/4-overlaps Y^, let Z[ = W^CiZi, so \Z'f,\ > (3\Zi\/4. We next show that there are many 
Z'^ which satisfy 

d{Us-d,Z',)<8i^. (9) 

Indeed, the union of the Z'^ which do not satisfy ([9]) has cardinality at most ^l-R^^I, so at least 1/2 of 
W^ consists of the union of Z'^ which satisfy Q . The number of i which satisfy ([9]) is at least 

l\R''\/\z,\ > i(|yi|/4)/|z,| 

= ^\Y\/\Z,\ = ^k/m„ 

where in the last equality we used \Y\ = \V\/mi and \Zi\ = \V\/k. 

For each Z^ which satisfies ([9|), we have d{Ud, Z'^) = 1 since {X, Y) is an edge of Gj and, therefore, the 
density of edges between Xy and Y^ is 1. Hence 

d{Ud, Z[) - d{U3^d, Z[)>l-8v> e. 

Since also \Ud\, \U^-d\ > Oi{ip — P)\Zj\ > S\Zj\, and \Z'^\ > j\Zi\ > 6\Z(\, we have in this case (Zj, Z() 
is not (e, 5)-regular. 

Since the number of such Y is at least |A^(X)|/16, we have that the number of pairs {Zi,Y^) such 
that Zi /3/4-overlaps Y^ and {Zj,Ze) is not (e, 5) -regular is at least {^k/rrii) (|A'"(X)|/16) > 2^^pik, 
where we used |A^(X)| > ■^Pi'^^i from Lemma 13.111 As Zi /3/4-overlaps Y^ in each such pair, a given 
Z^ is in at most Aj3~^ such pairs. Hence, the number of Zi for which (Zj, Zf) is not (e, 5)-regular is at 
least 2-i^/3pjA; > O^'^rik. D 

Like Lemma |3.14| the next lemma shows that if Zj satisfies certain conditions, then there are at least 
9^^-qk Zi such that [Zj^Zi/) is not (e, (5)-regular. 
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Lemma 3.15 Suppose X G Pj\ Wj, Kj > 1/2, Zj is ripe with respect to i, and Zj (5-overlaps X. Then 
there are at least O^^rjk sets Zi £ Z for which (Zj,Zi) is not (e, 5) -regular. 

Proof: Since Zj is ripe with respect to i, \X Ci Zj\ < (1 — /3)\Zj\. Therefore, letting U' = Zj \ X, we 
have \U'\ > f3\Zj\. 

By Lemma [3. Ill for each vertex u of G which is not in X, we have d{v, U{X)) < v. Since X Wi, we 
have 

|A^(^) n Si\ > K^\N{X)\/4 > \N{X)\/8. (10) 

So 

d{v, (J Y)< 8i/. (11) 

YeN(X)nSi 

Fix for this paragraph Y G N{X) D Si. Since Zj /3-overlaps X, there is d = d{j, Y) G {1, 2} such that 
Zj /3/2-overlaps X^. Let Uy = Zj n X^, so \Uy\ > §\Zj\ and d{UY,Y^) = 1. As F G Si, we have Y 
(/3, 7)-supports Z. By Proposition 13. H Y^ (/3/4, l/4)-supports Z. 

For Y G N{X) n Si, let Ry denote the set of vertices y which are in Y^ with d = d{j, Y), and y is also 
in a Zi which /3/4-overlaps Y^, so 

1^1 \V\ 

\Ry\ > -\Y^\ = -\Y\ = ^. (12) 

Let R = [jyeN{X)ns, ^Y- We have 

\R\ > \N{X) n Si\^ > 2-6|Ar(x)|I^ > 2-6^!^ . M = 2~'pi\V\, (13) 

STTij rui 2 mi 

where we used (^, ([lOD, and |iV(X)| > Pimi/2. 
By ([II]) and ([El), we have for v ^ X, 

d{v,R)<2^v. (14) 

By (dl]), we have d{U',R) < 2^u. For Z^ which /3/4-overlaps Y^ for some F G N{X) n 5^ and 
d = d{j, Y), let Zf = Zir\Y^, so \Zj\> (/3/4)|Z^|. By definition, for each Y G iV(X) n 5i, iJy is the 
union of the sets Zj . We next show that there are many Zj which satisfy 

(i(C/',Zf) <2V (15) 

Indeed, the union of the Zj which do not satisfy p^ has cardinality at most \\R\, so at least 1/2 of 
R consists of the union of Zj which satisfy (|15|) . The number of pairs {i,Y) which satisfy (|15p is at 
least 

\\R\/\Z,\ > ^2~'p^\V\/\Zi\ = 2-^p,k. 
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where we used (J13p and \Z(\ = |1^|/A;. Since for each such £, we have Zi /3/4-overlaps Y^, each such i 
is in at most 4/3^-'^ of the pairs (£, 1") we just counted. Hence, the number of t for which there is Y 
such that (jlSp holds is at least 2~^^Ppik. 

By ([15]) and c?(C/y, Zf) = 1, we have 

d{UY, Zj) - d{U', Zj) > 1 - 2^zy > e, 

and as |?7y|, \U'\ > 2\^j\ — ^\^j\ ^^'^ \^e\ — fl^^l — ^\^i\i '^^ have that {Zj,Zi) is not (e, (5)-regular 
for at least 2-i°/3pi/c > 6I-I77A: values of £. D 

The following theorem completes the proof. 

Theorem 3.2 The number of ordered pairs {Zj, Zg) which are not (e, 6) -regular is at least rjk'^. 

Proof: By assumption, Z is not a (/5, i;)-refinement of Pa- Hence, the number of parts Zj of partition 
Z which are not (1 — /3)-contained in Pa is at least vk. Let zq be the minimum positive integer for 
which Pig is not a (/3, t;)-refinement of Z. As, by assumption. Pa is not a (/?, t;)-refinement of Z, we 
have 1 < io < a. 

Claim 3.1 We have ki = 1 and Ki > 1/2 for i < zq. 

As /3 = Q/mi, by Lemma |3.12| each of the mi parts of partition Pi (/3, 1 — ^)-supports Zj. As C, = u 
and 7 = 1 — w, it follows that Si = Pi and ki = \Si\/\Pi\ = 1. From the definition of zq, for each i < iq, 
Pig is a (/3, t;)-refinement of -H. Fix for this paragraph such an i < zq. Hence at most a (/3 + r;)-fraction 
of the vertices are in parts Zj n X with X £ Pi and Zj £ Z and \Zj riX\ < {1 — P)\Zj\. In particular, 
as 1 — /3 > /3 and 7 = 1 — 0;, the fraction of X £ Pi which do not (/3,7)-support Z is at most B^. 
Hence kj > 1 — ^-^ > 1/2, which completes the proof of Claim [STTl 

Consider the partition Z = Z^ U Z'^ U Z^ U Z"^ U Z^ U Z^, where Zj G Z if h is minimum such that 
Zj satisfies property h below. 

1. There is i < io and X £ Pi\ Wi such that Zj is shattered with respect to i and (1 — /3)-overlaps 
X or if Zj is ripe with respect to i and /3-overlaps X, 

2. For every X G Pi such that Zj /3-overlaps X, X £ W\. 

3. There is i, 1 < i < zqi and X £ Wi such that Zj /3-overlaps X, 

4. zq > 1 and Zj is ripe with respect to io, and there is X G Wj^, such that Zj /3-overlaps X. 

5. Zj is ripe with respect to ioi and there is X G Pjq \ Wi^^ such that Zj /3-overlaps X. 

6. Zj is (1 — /3)-contained in Pjq. 

It is not immediately obvious that the above six subfamilies of Z form a partition of Z, so we first 
show that this is indeed the case. 
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Claim 3.2 The above six subfamilies form a partition of Z. 

As Zj G Z'^ if and only h is the minimum such that Zj satisfies property /i, the subfamihes Z^, 
1 < /i < 6, are pairwise disjoint. We thus need to show that each Zj is in at least one Z^. Suppose 
for contradiction that Zj is in none of the Z . By property 6, Zj is not (1 — /3)-contained in Pjq. If Zj 
is /3-contained in Pi^ , then Zj is ripe with respect to io , and there is X G Pi^ such that Zj /3-overlaps 
X. Either every such X € Wi^ or there is such an X ^ ^ioi ^^^ by properties 2, 4 and 5, we must 
have in this case Zj is in a Z^ for some h < 5. So Zj is not /3-contained in Pj^, and noting that 
every Zj is /3-contained in Pi, we must have Zj is ripe or shattered with respect to at least one i with 
1 < i < if). In particular, there is i < iq and X £ Pi such that Zj is shattered with respect to i and 
(1 — /3)-overlaps X or Zj is ripe with respect to i and /3-overlaps X. Since Zj ^ Z^, for every such 
i < io and X £ Pi, we must have X £ l^j. But then Zj G 2^^ or ^'^. Thus Zj is in at least one of the 
six subfamilies, completing the claim that that these subfamilies indeed form a partition of Z. 

As the number of parts Zj of partition Z which are not (1 — /3)-contained in Pj^ is at least vk, we have 

\Z\Z^\>vk. (16) 



Let Wi = \Wi\/\Pi\. By Claim [3?T| ki = 1 and kj > 1/2 for i < zq. Hence, from Lemma 13.131 we have 

/i<i<io ^« - "^2" • Here we used pi > m^" 



wi < lOOp]^ m^^ ln(2e) < m^ and similarly w := X]i<i<i ^« — "^2 • Here we used Pi> m^ 



mi > 2^00, and m^ > 2™'-i. 

We next bound the size of Z'^ . If Zj 6 2^^ , since Zj does not /3-overlap any X £ Pi\ W\ , and | Pi | = ttt-i , 
then at least a (1 — /3?TT-i)-fraction of Zj is contained in sets X € Wi. Hence, the fraction of Zj G Z 
which satisfy Zj E Z"^ is at most (1 — f^m\)~^w\ < 2m^ , i.e., |-Z^| < [2m^ )/;:. 

Similarly, the fraction of Zj £ Z such that there is 1 < i < io and X G Wi for which Zj /3-overlaps X 
is at most (3~^m2 ■ Hence, \Z'^\ < /3~^?n,2 A:. 

By Lemmas 13. 141 and 13. 151 each Zj G 2^^ is in at least 6~^r]k pairs {Zj, Zi) which are not (e, (5)-regular. 
We are thus done if \Z^\ > 6k. So we may suppose \Z^\ < Ok. 

We next give a lower bound on Kj^. 

Claim 3.3 We have Ki^ > 1/2. 

Note that if zq = 1, by Claim \3A\ ki = 1. So we may suppose that io > 1. In order to give a lower 
bound on Kjp, we next give an upper bound on the union of all sets Zj D X with |Zj n X| < /3|Zj| and 
X G Pj„. If Zj is not (1 — /3)-contained in Pj^, then it must be shattered or ripe with respect to some 
i with i < io, 01 must have at most V'l-^jl vertices in parts X n Zj with X £ Pi^ and \X f] Zj\ < l3\Zj\. 
Each Zj which is shattered or ripe with respect to some i with i < io is in Z^, Z'^ , or Z^ , and hence 
the number of such Zj is at most 

\Z^ U ^2 u 2:^1 < 0A; + {2m{^l'^)k + /3~ V~^/^A;. (17) 

Every set Zj which (1 — /3)-overlaps Pj,, has at most a /3-fraction of it contained in sets X n Zj with 
\X CiZjl < P\Zj\ and X £ Pi^. In total, we get that the fraction of vertices which belong to one of the 
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sets X n Zj with \X r\ Z j\ < j3\Z j\ and X G Pj^ is at most 

e + {2m-^'^) + /3-im~^/^ + /3 + V'- 

The fraction of sets in P-^^ which do not (/3, 7)-support Z is therefore 

1 - Kio < w^i (e + (2m^^/^) + /?^im^^/^ + /3 + V') < 1/2. 

Hence, kj^ > 1/2, which completes Claim [3T3l 

Noting that Kjq > 1/2, the same argument that bounded \Z'^\ also gives that 

\Z^\ < r^m^^^'^k. (18) 

From the bounds p^ . (fT7|) . (fTSj). we have 

\Z^\>\Z\Z^\- \Z^ UZ^U Z^\ - l^^l >vk- (ok + 2m:[^''^k + /3-^m^^/^A;) - /3-^m^^/^A; > 9k. 



As Kjo > 1/2, by Lemma |3.15| each Zj € Z^ is in at least 9 ^rjk pairs {Zj^Z^) which are not (e, 5)- 
regular. Hence, the number of irregular pairs is at least 

\Z^\9-^'qk >r]k'^, 

which completes the proof. □ 

3.4 Proof of Theorem [TT] 

To prove Theorem ll.il it suffices to prove the fohowing theorem. 

Corollary 3.3 Let e = 1/2, 6 = 2"''°°, r] < 2~™^ , s = [2~^^°t]~^\, and k be at most a tower of twos 
of height s. There is a graph G = iy, E) for which any equitable partition ZofV into at most k parts 
has at least rjk"^ ordered pairs of parts which are not {e, 6) -regular. 

Proof: Let mi = 2^'^'^ and pi = max(?nj ' , 2^'^^?/) for 1 < i < s — 1 and consider the graph G given 
with positive probabihty by Theorem 13.11 As u = S^j'Z^ pi, we have 

1^ < 3 |](m-^/^° + 2^00^) ^ 3 . 2500^(^ - 1) + 3 1] mT'/'' < 3 • 2-1°° + 3^pi < 6pi < 2-1°, 

SO 1 — 2^1/ > e. The first inequahty uses that the maximum of two nonnegative real numbers is at most 
their sum. The second inequahty uses s = [2^^^^r]~^\ and the fact that the sum of tji^ rapidly 

1 -1/10 O-20 

converges, and pi = m^ = I . 

2^ 1/2 

Note that as mi = 2"^^^ > 2^ and ?nj > 2™^-! for i > 1, we have |Pj| = m,j is greater than a tower 
of twos of height i + 2 for 1 < i < s. By Theorem 13.11 with a = s — 1, any (e, 5, ?/)-regular equitable 
partition of G is a (/3, t!)-refinement of Pg-i- In particular, at least one part of Ps-i contains at least 
a (1 — /3)-fraction of a part from Z. As 1 — /3 > 1/2, this implies \Z\ > gl-P^-il > k, which completes 
the proof. □ 
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4 Lower bound for the strong regularity lemma 

In this section we prove Theorem 11.21 which gives a lower bound on the strong regularity lemma and 
states the following. Let < e < 2"^'^'' and / : N — > (0, 1) be a decreasing function with /(I) < 2~^^^e^. 
Define Wi recursively by Wi = 1, Wi^i = T (2~'^^ e^ / f (W^)^ , where T is the tower function defined 
in the introduction. Let W = Wt-i with t = 2~'^^e~^. Then there is a graph G such that if equitable 
partitions A,B of the vertex set of G satisfy q{13) < q{A) + € and B is /(|^|)-regular, then |^|, \B\ > W. 

We next describe the proof of Theorem 11.21 In the first subsection, we construct the graph G as a 
specialization of the construction in Theorem 13.11 The graph G we use to prove Theorem 11.21 has 
vertex partitions Pij with 1 < i < t, and 1 < j < hi satisfying Pi'j' is a refinement of Pij if i' = i 
and f > j or if i' > i. Furthermore, as the number of parts in each successive refinement is roughly 
exponential in the number of parts in the previous partition, we show in the first subsection that 
\Pt-2,ht-2-2\ ^ W- The edges of G are defined based on certain graphs Gij on Pjj. In Subsection 14. 3t 
we prove a lemma which implies that the construction has the property that 

q{Pi,hJ>q{P,^h,_2) + 2e (19) 

for each i < t. 

Let A and B be equitable partitions of the vertex set of G such that q{B) < q{A) + e and B is /(|^|)- 
regular. Let Mi = 1 and Mi = \Pi-i^hi_i~2\ for 1 < ^ < t — 1. Let r with l<r<t— Ibe maximum 
such that 1^1 > Mr. Let P' = Pr,hr-2 and P = Pr,hr- ^^ Subsection 14. H after defining G, we show 
that it satisfies the hypothesis of Theorem 13. H and conclude that, as B is an /(|^|)-regular partition 
of G and / is a decreasing function, it must be close to being a refinement of P. It follows that if 
1^1 > -Mt„i = \Pt-2^ht-2-2\ > ^) then \B\ > W as well, and we are done in this case. Thus we may 
assume |^| < Mt-i and hence r <t — 2. In Subsection 14.41 we prove 

qiA) < q{P') + |. (20) 

This follows from a lemma that states that q{P') is close to the maximum mean square density density 
over all partitions of the same number of parts as P' . In Subsection 14.21 we use the result that B is 
close to being a refinement of P to conclude 

q{P)<q{B)+'-. (21) 

Putting the three estimates (fT9ll (with i = r and noting in this case Pi^hi = P-, Pi,hi-2 = P')-, (!20|l . 
(pT]) together, we get that 

q{B) > q{P) -'-> q{P') + 2e - | > q{A) + e, 

contradicting the hypothesis of Theorem 11.21 and completing the proof of Theorem 11.21 D 

4.1 Construction of G and proof that B is an approximate refinement 

We will construct the graph G as a special case of the construction in Theorem 13. 1[ 
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Let t = 2~^e~ . We have partitions Pij of the vertex set V for I < £ < t and 1 < j < hi, where hg 
is defined later in the paragraph and Pij = Pi are the partitions used to construct G in Theorem 13.11 
with i = j + "^cKi^d- We set mij = \Pij\ = \Pi\ = rrii, and pij = pi. As above, let Mi = 1 and 



Mf 



1/10 o30. 



mi_i^hi_i~2 for 1 < £ < t. Let e^ = /(M^), h^ = 2^, and pij = nmx{m^j' ,2'^"e ''e^) for 



1/10 o30^-4^„ nlO, 



'i- < j < he with j y^ hi- 1, and P£,/i^-i = max(m^_^. , 2'^"e ^e^, 2^"e). 

Let mi = 2^'^e~^, so ttii > 2^'''^. Note that, as each mej is exponential in a power of m£j_i, we get 
that M^ is at least a tower of 2s of height h^. That is, M^ > T (2~'^''e^//(M^„i)). In particular, by 
induction. Mi > Wi, where Wi is defined earlier in this section. 

We will apply Theorem 13. II to conclude the following corollary which states that any sufficiently regular 
partition of G is roughly a refinement of a particular Pej. To accomplish this we need to show that 
the conditions of the theorem hold, which we postpone until after stating the following corollary. We 
fix G to be a graph satisfying the properties of Lemma 13.111 so that if G also satisfies the conditions 
stated in Theorem 13. H then it satisfies the conclusion of Theorem 13.11 

Corollary 4.1 Let r < t — 1 be the maximum positive integer for which \A\ > Mr, so /(|^|) < 
f{Mr) = er, and P = Pr,hr- '^he partition B, which is e^-regular, is a [(5, v) -refinement of P with 
/3 = 20777.]^ and v = hm^ . 

Note that 

i=i 1=1 j=i e=i j=i 

< 2'^et + Y, <'^'° + E E 2'°e"'e^ < 2'°e* + E <'^'° + E 2~'°e < 2-', 

i=l £=1 j=l i=l i=l 

where we used that the maximum of a set of nonnegative numbers is at most their sum, and substituted 
in hi = 2^, mi = 2i0e-2 > 2200^ ^,^+1 = m^iUi > mi2L2"'°'^?'''°J, and t = 2-'^°e-K We thus have 
1 — 2^z^ > 1/2 > er- Notice if r/ = e^ = f{Mr), then, for 1 < i < r and 1 < j < hi, we have 

P^,, > 2=^°e-^e, > 2''e-% = 2''r,ml 

where we used mi = 2^^e~'^. Since (3 = 20m^^^^ = 20 • 2~^^e^ and /(I) < 2~^^^e^, we have 6 = er = 
f{Mr) < f{Mi) = /(I) < /3/4. 

By the above estimates, the conditions of Theorem 13.11 are satisfied, and Corollary 14.11 stated above 
indeed holds. □ 

Note that if r = t - 1 in Corollary gH then |^| > Mt_i = \Pt~2,ht-2~2\ > W, and B is a {f3,v)- 
refinement of P = Pr,hr- As 1 — /3 > 1/2, this implies \B\ > gl-^r./irl > ^' which completes the proof 
of Theorem 11.21 in this case. We can therefore assume r < t — 1. 

4.2 Approximate refinements and mean square density 

From Corollary 14.11 and the following lemma, we deduce at the end of this subsection that if P is the 
partition in Corollarv l4.1l then q{P) < q{B) + |. 
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Lemma 4.1 Suppose G is a graph, P is a vertex partition, and Q is an equitable partition which is a 
{j3,v) -refinement of P. Then q{P) < q{Q) + 2(3 + \v. 

Proof: Let Q' be the common refinement of P and Q, so q{Q') > q{P)- Let X,Y ^ Qhe such that 
X, Y are each (1 — /3)-contained in P. Let X = Xi U . . . U X^- be the partition of X consisting of parts 
from Q' with \Xi \ > (1 — /3) |X| , and y = Yi U . . . U Kj be the partition of Y consisting of parts from Q' 
with inl > (1 - /3)|y|. Let p = d{Xi,Yi) and p' = j^^Zd{Xi,Yj)piqj, where p, = ^, qj = ^ 
and the sum is over all pairs {i,j) G [r] x [s] except {i,j) = (1, 1). That is, p' is the weighted average 
edge density between the pairs of parts except (Xi, Yi). We have 

r s 

X] ^ d^i^i^ ^j)Pi(lj < P^PiQi + Yl d{Xi,Yj)piqj = piqip^ + p{l - piqi) 
«=i i=i (ij)7^(i,i) 

and 

d{X,Y) =ppiqi +p'{l -piqi). 

Let e = 1 — piqi, so 

r s 

= e ((1 - e)p2 + p' _ 2pp'{l - e) - p'^e) 

< e((l-e)p2+p'-2pp'(l-e)) 

< e 

< 2/3. 

The second to last inequality is by noting the right hand side of the third to last line is linear in p' 
and must therefore be maximized when p' = or 1, and the last inequality follows from e = 1 — piqi 
and pi,qi > 1 — /3. 

Now for parts X,Y £ Q that are not both (1 — /3)-contained in P, again letting X = Xi U . . . U Xr 
and y = li U . . . U y^ be the partitions of X and Y consisting of parts from Q', and letting q denote 
the edge density between X and Y, and pi = Wr, qj = -U^, we have 

r s 

Y, Y. d\X„Y^)p,qj - d\X, Y)<q-q^< 1/4. 
i=i i=i 

Since Q is a (/3, t;)-refinement of P, at most a 2i;-fraction of the pairs of parts from Q are such that not 
both parts are (1 — /3)-contained in P. Putting together the estimates from the last two paragraphs, 
we therefore get 

q{P)<q{Q')<q{Q)+2(3 + ^-2v. 

D 

Noting that mi = 2^'^e~^, /3 = 20mi < e/8 and v = Sm^ < e/4 in Corollary 14. H we have the 
following corollary of Corollary 14.11 and Lemma 14. 1[ 
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Corollary 4.2 If P is the partition in Corollary \4-l\ then q{P) < q{13) + |. 

4.3 Mean square densities of the defining partitions 

The next lemma shows that the mean square density of each successive partition increases by a constant 
factor of the edge density of each Gj. 

Lemma 4.2 For each i, we have q{Pi+i) > q{Pi) + 2~^pi. 

Proof: The fraction of pairs {X, Y) ^ PiX Pi which are edges of d and not edges of G* is at least pi/A 
by the second property in Lemma 13.111 For each such pair, the equitable partitions X = Xy U Xy, 
y = y| U y| satisfy diX^,Y^) = 1 and d{X^,Y^-'^) < i/ < 1/4 for d = 1,2. Let di = d(Xi.,y|) 
and d2 = d{XY , Y^ ) , so 

f f id^W, yi) - d'^ix, y ) = i + ii±^ - f i + *±* V > i - <i?i±i« > i 

i=i j=i ^ ^ 

As we get this density increment for at least a pj/4-fraction of the pairs {X,Y) G Pj x Pj, we get a 
total density increment of at least g^ = 2~^pi. D 

We have the following corollary noting that pr^h^-i ^ 2^'^e. 
Corollary 4.3 For P = Pr,hr o^^d P' = Pr,hr-2, we have 

q{P) = q{PrM.r) > q{Pr,hr-i) + 2-^p.,h,._i > q{Pr,hr~i) +2e> q{P') + 2e. 

4.4 Quasirandomness and mean square density 

The goal of this subsection is to show that if ^ is a vertex partition of G with |^| < \Pi\, then ^'(.4) 
is at most q{Pi) + Pi plus a small error term. To accomplish this, we show that the graphs used to 
define G are quasirandom with small error. 

The study of quasirandom graphs began with the papers by Thomason [32] and Chung, Graham, and 
Wilson [H]. They showed that a large number of interesting graph properties satisfied by random 
graphs are all equivalent. These properties are known as quasirandom properties, and any graph that 
has one of these properties (and hence all of these properties) is known as a quasirandom graph. 

This development was heavily influenced by and closely related to Szemeredi's regularity lemma. 
Furthermore, all known proofs of Szemeredi's theorem on long arithmetic progressions in dense subsets 
of the integers use some notion of quasirandomness. For graphs on n vertices with edge density p 
bounded away from zero, the following two properties are quasirandom properties. The flrst property 
states that the number of 4-cycles (or, equivalently, the number of closed walks of length 4) in the 
graph is p^n'^ + o{n'^). The second property states that all pairs of vertex subsets S,T have edge 
density roughly p between them, apart from o(n^) edges. This fact, that the number of 4-cycles in a 
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graph can control the edge distribution, is quite notable. For our purposes, we will need to show that 
the first property implies the second property, with reasonable error estimates. The now standard 
proof bounds the second largest (in absolute value) eigenvalue of the adjacency matrix of the graph, 
and then applies the expander mixing lemma, which bounds the edge discrepancy between subsets in 
terms of the subset sizes and the second largest eigenvalue. 

Lemma 4.3 Suppose G = (y, E) is a graph with n vertices and average degree d, and the number of 
closed walks of length 4: in G is at most d^ + an^. For all vertex subsets S and T, 



n 



where A < a^'^n. 



Proof: Let A be the adjacency matrix of G, and Ai, A2, • • • , A.„ be the eigenvalues of A, with |Ai| > 
IA2I > • • • > |A„|. Let A = |A2|. It is easy to check that Ai > d. Let A = IA2I. The number of closed 
walks of length 4 in G is equal to the trace 

n 

Tr(A^) = j;Af>Af + A^ 

As Ai > d, and the number of closed walks of length 4 is at most d'^ + an'^, we conclude A < {an'^Y''^ = 
a^''^n. The expander mixing lemma (see Section 2.4 of |28j ) states that for all vertex subsets S,T, we 
have \e{S,T) ' ^ ' \ < X^\S\\T\. This completes the proof. □ 

A spanning subgraph of graph G is a subgraph of G on the same vertex set V as G. We let Hi be 
the spanning subgraph of G where vertices u,v £ V are adjacent in Hi if and only if there is an edge 
{X, Y) of Gj, and j G {1,2} with u G Xy, v E Y^. Note that the edge set of G is precisely the union 
of the edge sets of the Hi, although this is likely not an edge partition. We next use Lemma 14.31 to 
show that the edges of Hi are uniformly distributed. That is, the edge density in Hi is roughly the 
same between large vertex subsets of V . 

Lemma 4.4 Let \V\ = n. For each i, the graph Hi on vertex set V has the property that for all vertex 
subsets S and T , 

\eHAS.T) - ||5||r|| < 2mT"^%,n^J\Sm- 

Proof: Note that each edge {X, Y) of Gj gives rise to two complete bipartite graphs, between Xy 
and Y^ with j G {1,2}, in Hi. In particular, each such edge of Gj contributes 1^ degree in graph Hi 
to each vertex in X and in Y . 

We first give a lower bound on the average degree d in Hi. From the first property in Lemma I3.1H 
every vertex in Gj has degree differing from piTUi by at most m^ . Hence, every vertex in Hi has 
degree differing from Pimii^ = pin/2 by at most mj ■ ^^ = ^m- n. Thus, the average degree d 

of Hi satisfies \d — 2Pin\ < 2^^j n. 
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We next give an upper bound on the number VF4 of labeled closed walks of length four in Hi. By 
counting over the first and third vertex of the closed walk, we have W4 = X^^ j, \NHi{u,v)\'^, that is, 
W4 is the sum of the squares of the codegrees over all labeled pairs of vertices of Hi. By the first part 
of Lemma 13.111 if X, Y are distinct parts of partition Pi , then the codegree of X and Y in Gi is at 
most pfrrii + m^ . Hence, from Corollarv l2.11 if u and v are in different parts in the partition Pj, then 

\NhAu, v)\ <{\+ aT^"){pjm, + mf )£- = {\ + at"){v\ + m:'l')n. 

For each pair of vertices u, v in the same part of Pj, we have u and v have the same neighborhood in 
Hi and in this case we use the trivial estimate \Nii^{u^v)\ < n. In total, we get 



< (1 + 5mr^/'°) p^nVie, 

where we used Pi > rUi , cLi = 2^""^^ J with p = 2~ and rui > rrii > 2 . 



TTi,- )n I + rrii \ — | • n 
rui 



Let 



a = n-4 {W4 - (f) < n-^ (w^ - (1 - Am-^'%T^)p1n'^ /l&) < (smr^/^o ^ 4^-1/4^-A ^y^Q 



. -1/20 4 
< rUi ' Pi. 

By the choice of q, we have W^, = d'^ + an^. From Lemma |4.3( we have 

n 

Substituting in that the average degree d differs from pin/2 by at most ttIj n./2, the bounds a^' ^ < 
rrii Pi, i^i /2 < m^ pi, and \S\\T\ < nY^|5||r|, and using the triangle inequality, we have the 
desired estimate holds on the number e/f . (S, T) of edges in Hi between S and T. D 

We next prove the following lemma which estimates the edge density of G between certain vertex 
subsets. 

Lemma 4.5 LetX,Y G Pi he distinct with (X, 1") not an edge of G^ . If also {X,Y) is not an edge of 
Gi and A C X, B C Y, or if {X, Y) is an edge of Gi and there is j G {1, 2} such that A C Xy and 
B C Y^'\ then 



d«(A,B)-(i-n(i-f)) 

\ h>i J 






^/\Am' 



where n = \V\ is the number of vertices of G. 
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Proof: For i' > i, let di' denote the density between A and B of the pairs which are edges of at least 
one Hi with I < i'. In particular, by the choice of A and B, no edges of Hf^ for h < i go between A 
and B, and hence di = 0. Furthermore, we have dj+i = dHi^i{A,B). By Lemma 14.41 the number of 
edges between A and B in Hi^i satisfies 



CH^^AAB) 



Pi+i I ^11 ol ^ r,„.-l/80 



|^||i?| < 2mZ_Y>,+invWBl. 



(22) 



Let ti = 1 and for i' > i, let tj' = rij+i</i<j' (^ ~ ^)- ^^ prove by induction on i' that for each 
i' > i + 1, we have 



|di, -(1-iiOI <Q Yl i^+Ph), 

h=i+l 



(23) 



where 



„ _ 9^-1/80 



n 



^AMB\ 

In the base case i' = i + 1, we have the desired estimate ([25]) from dividing ([22]) out by |^||-B|. So 
suppose we have established (p3]) for i', and we next prove it for i' + 1, completing the proof of (p3|) 
by induction. 

Let X',Y' £ Pi, with X' C X,Y' C Y, and (X',y') not an edge of G'' . If {X',Y') is not an edge of 
Gi', letting A' = X' n A and B' = Y' n B, or if {X' , Y') is an edge of Gi>, and letting j e {1, 2} and 
A' = X'-^nA and S' = y^^~^' nB,we have 



eH,^,(^',i?' 



^^U'llS'l 



-1/80 



< 2m^,;';V+inV|A'||B'|. 



Each such pair X', y' with (X', y') not an edge of Gi' gives rise to a pair {A' , B'), and each such pair 
with (X' , Y') an edge of Gi' gives rise to two pairs {A' , B') of this form, one for each j G {1, 2}. The 
number of pairs {X' ,Y') is (l — dQii{X,Y)\ {rrii'/mi) . The total number A of such pairs {A',B') is 
therefore 

A = (1 - d^, (X, Y) + dc^, (X, Y)) [mvlrriif. 

On the other hand, the sum of |A'||i?'| over all such pairs is (1 — (ii')|^||i?|. Hence, the average value 
of l^'ll^'l over all such pairs {M ,B') is (1 - di')\A\\B\l I\. 

By the triangle inequality, summing over all such pairs ^', i?', we have the number of edges E of -ffj'+i 
between A and B which are not edges of any Hi with i <i' satisfies 



E-P^^il-d,')\A\\B\ 



< j; 2m,;if V+invi:4Wl 



A',B' 



< 2m,;;fV+in((l - d,)\A\\B\Y/^ll"^ 

< Am;,l{^%,'+in{\A\\B\)^/^mi'/mi, 

where we used Jensen's inequality for the concave function f{y) = y^'^. 
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Hence, 

Mi'+i - (1 - U'+i] 



dHA,B) + 



E 



A\\B\ 



(1 - U 



'+1. 



<\di>-{l-ti,)\ + 



E 



\A\\B\ 



{til - ti'+i) 



\de - {I - ti,)\ + 



E Pi'+i 



\A\\B\ 



Zi 



< (l + ^)|d,-(l-t,,)l + 



E Pi'+i 



< 



< 



1 + 



\A\\B 
-1/1 

i'+l 

i' 
Pi'+l\ TT n , „ \ I ^^-1/80 



(l-d,0 



(1 + ^M, - (1 - t,)\ + 4m;i/«V+in(|^||S|)-i/2^.V"^^ 



n (1 +pO +4m.,;fV+in(|A||Bri/2mi,/mi 



/i=i+l 
i'+l 

< q n (1+P/^)' 
/l=j+l 



which completes the induction proof of ([25 

As ^p/i < 1, we have J|(l +^5/1) < e. From (j23|) with i' = s — 1, we get 



dG(Ai?)- l-n(l-:P/^) 



/i>i 



n 



l<^^-i - (1 - is-i)\ <q Yl i'^+Ph) < 6m.^Y%i+i- 



which completes the proof. 



D 



The following lemma is the main result in this subsection, showing that q{A) — q{P') is small, where 
the mean square densities are with respect to the graph G. 

Lemma 4.6 For P' = Pr,hr-2, we have q{A) < q{P') + f . 



Proof: Consider the partition A' which is the common refinement of P' and A. The number of parts 
of A' is at most |P'||^| < |-P'P, and each part of P' is refined into at most \A\ < |P'| parts of A' . Let 
i be such that Pi = Pr^hr-2 = P' ■ As A' is a refinement of P', in Hj for each j < i between each pair 
of parts of A' the edge density is or 1. Noting that A! is a refinement of A, we have 



q{A) - q{P{) < q{^) - q{P{) = ^ 



m- 



E 



* ^ ■' \X\\Y\ 

x,YeP^ a,bcA',Acx,bgy ' " ' 



{d\A,B)-d\X,Y)). (24) 



Note that the summand in the above sum if {X, Y) is an edge of G' is as in this case d{A, B) = 
d{X,Y) = 1. We have d'^{A,B) - d'^{X,Y) < 1 for {X,Y) an edge of Gi, and the fraction of pairs 
(X, Y) which are edges of Gi is at most pi + m- . 

For a pair X,Y £ Pi with iX,Y) not an edge of G* or Gi, A,B C A' with A C X and B C Y, we 
have by Lemma 14.51 and the triangle inequality that 



\d{A, B) - d{X, y)| < 2 • 6m~Y%. 



n 



+1 Pi+i 



V\Mb\ 



(25) 
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Summing over all parts A, B of A! with A C X and B CY, we have 

J2 \A\\B\{d\A,B)-d\X,Y)) < Y. \A\\B\2\d{A,B)-d{X,Y)\ 

A,BgA' ,ACX,BCY A,BCA',ACX,BCY 

< Yl 2Am-^(^%i+iny^\A\\B\ 

A,BcA',AcX,BcY 

where the first inequality follows from a^ — 6^ = (a + 6)(a — h)< 2{a — h) for < a, 6 < 1, the second 
inequality is by ([25|) . and the last inequality is by using the Cauchy-Schwarz inequality, noting that 

Y \M\B\ = \X\\Y\ = {n/mif, 

A,BcA',AcX,BcY 

and the number of pairs A,BcA' satisfying A C X, B C Y is at most mf. 
Dividing out by |X||y| = (n/m,,)^, we have, 

E ]§§] (dHA, B) - d\X, Y)) < 24m-^Y' Vim?. (26) 

A,BcA',AcX,BcY ' " ' 

From the estimate (1261). we have from (1241) that 



q{A) - q{P') <p^ + m^ ^'^ + 24m,+Y ^+1"^? < Sp, < -, (27) 

where we used 

/ -1/10 r,30 -4 \ 

Pi = max(m. ,2r"e er), 
er < ei = /(I) = 2-10066, 

r>h,-2 = ^^-2>2^\~^ 



and hence wii > (6/e) . This completes the proof. 
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5 Induced graph removal lemma 

The induced graph removal lemma states that for any fixed graph H on h vertices and e > 0, there is 

6 = 6{e, H) > such that if a graph G on n vertices has at most 6n induced copies of H, then we can 
add or delete en? edges of G to obtain an induced H-bee graph. The main goal of this section is to 
prove Theorem 11.31 which gives a bound on 5~^ which is a tower in h of height polynomial in e-^. We 
in fact prove the key corollary of the strong regularity lemma. Lemma 11.21 with a tower-type bound. 
This is sufficient to prove the desired tower-type bound for the induced graph removal lemma. 

We first use the weak regularity lemma of Duke, Lefmann, and Rodl to find a large subset of a graph 
which is e-regular with itself. By iteratively pulling out such subsets and redistributing the set of 

38 



leftover vertices, we obtain a partition of any vertex subset into large vertex subsets each of which is 
e-regular with itself. Then, in Subsection 15.41 we establish Lemma 11.31 the strong cylinder regularity 
lemma, with a tower-type bound. We show in Subsection 15 . 51 that the strong cylinder regularity lemma 
implies the key corollary of the strong regularity lemma. Lemma 11.21 with a tower-type bound. This 
in turn implies Theorem 11.31 

In this section and the next, we call a pair (^4, B) of vertex subsets of a graph e-regular if for all A' C A 
and B' CB with \A'\ > e\A\ and \B'\ > e\B\, we have \d{A',B') - d{A,B)\ < e. 

5.1 The Duke-Lefmann-Rodl regularity lemma 

Given a A;-partite graph G = {V, E) with A;-partition y = T^ U . . . U V^, we will consider a partition fC 
of the cylinder Vi x • • • x V^ into cyhnders K = Wi x ■ ■ ■ x Wk, Wi C Vi ior i = 1, . . . , k, and we let 
Vi{K) = Wi. Recall from the introduction that a cylinder is e-regular if all the (g) pairs of subsets 
{Wi,Wj), 1 < i < j < k, are e-regular. The partition /C is e-regular if all but an e-fraction of the 
fc-tuples {vi, . . . , Vk) G Vi X • • • X Vfc are in e-regular cylinders in the partition K,. 

The weak regularity lemma of Duke, Lefmann, and Rodl [15] is now as follows. Note that, like the 
Frieze-Kannan weak regularity lemma, it has only a single-exponential bound on the number of parts, 
which is much better than the tower-type bound on the number of parts in Szemeredi's regularity 
lemma. Duke, Lefmann, and Rodl [15] used their regularity lemma to derive a fast approximation 
algorithm for the number of copies of a fixed graph in a large graph. 

Lemma 5.1 Let < e < 1/2 and (3 = /3(e) = e . Suppose G = iy,E) is a k-partite graph with 
k-partition V = ViL) . . .UVk- Then there exists an e-regular partition /C o/ Vi x • • • x V^, into at most 
/3~^ parts such that, for each K £ fC and 1 < i < k, we have \Vi{K)\ > /3|I^|. 

5.2 Finding an e-regular subset 

For a graph G = {V, E), a vertex subset U C V is e-regular if the pair ([/, U) is e-regular. The following 
lemma demonstrates that any graph contains a large vertex subset which is e-regular. 

Lemma 5.2 For each < e < 1/2, let 5 = 5{e) = 2~^ . Every graph G = {V, E) contains an 

e-regular vertex subset U with \U\ > 6\V\. 

Lemma 15.11 implies that each fc-partite graph G = {V, E) with /c-partition V = Vi U . . . U V^ has a 
cylinder K which is e-regular in which each part has size | Vi {K) | > e | V^ | . The proof can be easily 
modified to show that if each part of G has the same size, then each part of the e-regular cylinder 
K has equal size, which is at least e ^|^|- This implies that for any graph G = {V,E), if G has 
at least k vertices, by considering any k vertex disjoint subsets of equal size [|G|//i;J > \G\/{2k), and 
then applying this result, we get the following lemma. 



Lemma 5.3 For each < e < 1/2, any graph G = iV, E) on at least k vertices has an e-regular 

2fc* 



k- cylinder with parts of equal size, which is at least 2k^^ "" ^1^1- 
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The t-color Ramsey number rt{s) is the minimum k such that every t-coloring of the edges of the 
complete graph Kj. on k vertices contains a monochromatic chque of order s. A simple pigeonhole 
argument (see [23]) gives rt{s) < t^^ for t > 2. 

Lemma 5.4 For integers s,t > 2, let k = t*'^. Let G = iV^E) he a graph on at least k vertices, 
and < a < 1/2. The graph G contains an a-regular s-cylinder with parts of equal size at least 
N = 2kOi^ " 1^1 such that the density between each pair of parts differs by at most 1/t. 

Proof: Note that k = t^^ > rt{s). By Lemma 15.31 G contains an a-regular fc-cylinder Ui x ■ ■ ■ x U^ 
with parts of equal size at least A^. Partition the unit interval [0, 1] = /i U . . . U It into t intervals 
of length 1/t. Consider the edge-coloring of the complete graph on k vertices 1, . . . ,k for which the 
color of edge (i, j) is the number a for which the density d{Ui,Uj) G la- Since k > rt{s), there is 
a monochromatic clique of order s in this t-coloring, and the corresponding parts form the desired 
s-cylinder. D 

Lemma 5.5 Suppose a < 1/9 and Ui x ■ ■ ■ x Us is an a-regular cylinder in a graph G = {V, E) with 
s > 2a^^ parts Ui of equal size and the densities between the pairs of distinct parts lie in an interval 
of length at most a. Then the set U = UiU . . .UUs is e-regular with itself, where e = Sa^'"^. 

Proof: Let A,B C U with \A\, \B\ > e\U\ and, for 1 < i < s, let Ai = AnUi and Bi = B D Ui. 
Suppose d{Ui, Uj) E [7,7 + a] for 1 < -i < J < s. Let A^ be the union of all Ai for which \Ai\ > a\Ui\ 
and A^ = A\A^. Similarly, let B^ be the union of all Bi for which \Bi\ > a\Ui\ and B"^ = B\B^. We 
have l^^l < a\U\ < ae~^\A\ and \B'^\ < a\U\ < ae-^\B\. 

Let Ii denote the set of all pairs (i,z) with i £ [s], I2 the set of all pairs {i,j) G [s] x [s] with i / j, 
Ai C A^, and Bj C B^ and I3 = [s] x [s] \ {h U h). Let D{Ai, Bj) = \d{Ai, Bj) - 7] ^4jIM. We have 



\A\\B\ 



E ^-^ Ai Bi Bi C/j 1 

D(Ai,Bi) < > , .,, — - < max- — - < max- — - < — . 
^ ' - ^ \A\\B\ - i \B\ - t \B\ - se 

If (i,j) G /21 using the triangle inequality and a-regularity, 

\diA,Bj) - 7I < \d{A,Bj) - d{U,, Uj)\ + \d{Ui, f/j) - 7I < a + a = 2a. 



Hence, 



Y^ DiA,B,)< 5: 2a^<2a. 



Finally, 



J2 DiA,Bj)< Yl ^^<i_(^i_^j(^l_^j<l_(l_a6"i)2<2a6-^ 



{i,j)&h {i,j)&h 



40 



We have by the triangle inequahty 



\d{A,B)-^\ < J2 D{A,,B,) 

l<i,j<s 

= Y. D{A,B,)+ Y. D{A„B,)+ Y DiA,Bj 

(Jj)G-fi ii,j)&h {hj)&h 



1 



- se - 2 

By the triangle inequality, for any A,B,X,Y C U each of cardinality at least e\U\, we have 

\d{A, B) - d{X, Y)\ < \diA, B)-^\ + \j- diX, y)| < 1 + 1 = e. 

In particular, this holds for X = Y = U, and hence U is e-regular. D 

By applying Lemma [5.41 with a = (e/3)^ and s = t = [20"^^] , and then applying Lemma [5.5l we obtain 
Lemma |5.2[ Note that the proof assumes that the number of vertices of the graph is sufficiently large, 
at least A; = t**, but we can make this assumption as otherwise we can trivially pick U to consist of a 
single vertex, which is e-regular. 

The next lemma shows that if we have an e-regular pair, and add a small fraction of vertices to one 
part, then the pair is still regular, but with a slightly worse regularity. 

Lemma 5.6 Suppose A and B are vertex subsets of a graph G which form an e-regular pair, and C 
is a vertex subset disjoint from. B of size \C\ < /3\B\. Then the pair {A,B U C) is a-regular with 

a = e + ^/^ + l3. 

Proof: Let ^'C bands' UC'C 5UC with5' C B,C' C C, \A'\ > a\A\ and \B'uC'\ > a\BuC\. 
Note that \A'\ > a\A\ > €\A\ and \B'\ = \B' U C'\ - \C'\ > a\B U C\ - \C\ > (a - f3)\B\ > e\B\. Since 
the pair {A, B) is e-regular, we have 

\d{A',B')-d{A,B)\ <e. 
Also, \C'\ < \C\ < p\B\ < /3\B UC\< l3a~^\B' U C'\. Therefore, 
\d{A\B'uC')-d{A\B')\ -- 



''<-^'«''W!rk+^(-*'''''W?W-''<-^'«' 



We similarly have 

\d{A, BUC)- d{A, B)\ = \d{A, C) - d{A, B)\ J^ < /3. 

Hence, by the triangle inequality, we have |(i(j4', B' U C) — d{A, B [J C)\ is at most 

\d{A', B' U C) - d{A', B')\ + \d{A', B') - d{A, B)\ + \d{A, B) - d{A, BVJC)\ < I3a~^ + e + /3 

< a. 
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Hence, the pair {A, B L) C) is a-regular. D 
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By repeatedly pulling out 3e/4-regular sets using Lemma [5^ until there are at most ^|^| remaining 
vertices, distributing the remaining vertices among the parts, and using Lemma [5.61 twice in each part, 
we arrive at the following lemma. It shows how to partition a graph into large parts, each part being 
e-regular with itself. 

Lemma 5.7 For each < e < 1/2, let 6 = 5{e) = 2~'^ . Every graph G = {V^E) has a vertex 

partition V = Vi\J . . .UV^ such that for each i, 1 < i < k, we have \Vi\ > 6\V\ and Vi is an e-regular 
set. 

5.3 Tools 

In this subsection, we prove two simple lemmas concerning mean square density which will be useful 
in establishing and using the strong cylinder regularity lemma. 

The first lemma, which is rather standard, shows that for any vertex partition P, there is a vertex 
equipartition P' with a similar number of parts to P and mean square density not much smaller than 
the mean square density of P. It is useful in density increment arguments where at each stage one 
would like to work with an equipartition. 

Lemma 5.8 Let G = (V, E) he a graph, and P : V = Vi^J . . . yj Vk he a vertex partition into k parts. 
There is an equitahle partition P' ofV into t parts such that q{P') > q{P) — 2j. 

Proof: For an equipartition of V into t parts, we have a certain number of parts of order [|V^|/iJ 
and the remaining parts are of order [|y|/t]. For each part Vi £ P, partition it into parts of order 
[|y|/ij or [|V^|/i] so that there are not too many parts of either order to allow an equipartition of 
the whole set, with possibly one remaining set of cardinality less than |T^|/i. Let Q be this refinement 
of P. From the Cauchy-Schwarz inequality, it follows that q{Q) > q{P). Let U be the vertices in the 
remaining parts of Q, so \U\ < k\V\/t. 

Arbitrarily chop the vertices of U into parts of the desired orders so as to obtain an equipartition P'. 
We have 

q{P') > Yl d\X,Y)\j^ > q{Q) - ["l - ("l - MV^J > ^(Q) _ 2^ > q[P) - 2^. 

D 

The next lemma is helpful in deducing the induced graph removal lemma from the strong cylinder 
regularity lemma. Let G = (y, E) and P : V = V^i U . . . U Vfc be an equipartition, and /C be a 
partition of the cylinder Vi x • • • x V^ into cylinders. For K = Wi x • • • x Wk G /C, define the density 
^(^) = 'ji^|jx'''x|!4|' - The cylinder K is e-close to P if \d{Wi, Wj) - d{V, Vj)\ < e for ah but at most 
ek^ pairs 1 < i ^ j < k. if cylinder K is not e-close to P, then 

Y, \d{Wi,W,)-d{Vi,Vj)\>e^k\ 
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The cylinder partition /C is e-close to P if ^ d,{K) < e, where the sum is over all K ^ fC that are not 
e-close to P. Note that if /C is not e-close, then 

Y, Y, \d{Wi,Wj)-d{Vi,Vj)\d{K)>e^k\ 

KeK. l<if^j<k 

Recall that Q{IC) is the common refinement of all the parts Vi{K) with i £ [k] and K £ fC. 

Lemma 5.9 Let G = (V, E) and P :V = ViU . . .UVk he an equipartition with no empty parts, i.e., 
\V\ > k. Let IC be a partition of the cylinder Vi x ■ ■ ■ x Vk into cylinders. If Q = Q{IC) satisfies 
q{Q) < q{P) + e, then /C is 2^/'h^/^ -close to P. 



Proof: Let Qi denote the partition of Vi which is the restriction of partition Q to V^. 
Since P is an equipartition and \V\ > k, then all parts have order at least 



\v\ 

k 



> -2^. Therefore, 



e>q{Q)-q{P)= Y. ('?(Qi'Qi)-9(^i'^i))^^IP^4^ E (9(Q*'Qi)-^(^*'^i))' (28) 

l<i,j<k ' ' l<i¥'j<k 

where q{Qi,Qj) = Y^AeQ^BeQj '^^(^' B)paPb with p^ = |^ and pB = j^, and q(yi, Vj) = d'^{Vi, Vj). 
Fix for now 1 < i ^ j < k. For K = Wi x ■ ■ ■ x Wk G /C, we have 

d{Wi,Wj) = YdiAB' '^' '^' 



Wi\ \Wj\' 



and hence, by the triangle inequality, 

\d{W,,Wj) - diVi,Vj)\ <Y\d{A,B) - d{V,,V,)\^^ 



Wi\ \Wj\ 



where the sums are over all A £ Qi with A C Wi and B G Qj with B C Wj. Summing over all K £ }C, 
we have, 

Y \d{w,,Wj)-d{Vi,Vj)\d{K) < Y El^(^'^)-^(^-^i)lr^TM^W 

Y \d{A,B)-d{V,,V,)\pAPB 

< I Y {d{A,B)-d{Vi,Vj)fpAPB 
^AeQ„BeQj 

= {q{Qi,Qj)-q{V^,VJ))'/\ 

where the first equality follows by swapping the order of summation and the last inequality is the 
Cauchy-Schwarz inequality. 
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Summing over aW I < i ^ j < k and changing the order of summation, 

^1/2 

< [k' Yl QiQ^^QJ)-Qiyi,yj)' 

< Vfc2 . 4fc2e = 2el/2^^ 

where the second inequahty is the Cauchy-Schwarz inequahty and the last inequahty uses the estimate 
(j28p . By the remark before the lemma, we get that K, is (2e^") = 2^'^e'^'^-close to P. □ 

5.4 The strong cylinder regularity lemma 

Using the lemmas established in the previous subsections, in this subsection we prove Lemma ll.3|, the 
strong cylinder regularity lemma, with a tower-type bound. 

Recall that a /c-cylinder Wi x • • • x Wk is strongly e-regular if all pairs (VFj, Wj) with I < i,j < k are 
e-regular. A partition /C of Vi x • • • x Vfc into cylinders is strongly e-regular if all but e|Vi| x • • • x \Vk\ 
vertices {vi, . . . ,Vk) G Vi x • • • x V^ are contained in strongly e-regular cylinders K £ IC. 
We recall the statement of the strong cylinder regularity lemma. 

Lemma 5.10 For < e < 1/3, positive integer s, and decreasing function / : N — )■ (0, e], there is S = 
S{e, s, f) such that the following holds. For every graph G, there is an integer s < k < S, an equitable 
partition P : V = ViU . . .U V^ and a strongly f{k) -regular partition K, of the cylinder Vi x ■ ■ ■ xV^ 
into cylinders satisfying that the partition Q = Q{fC) of V has at most S parts and q{Q) < q{P) + £• 
Furthermore, there is an absolute constant c such that letting si = s and s^+i = t4^{{si/ f{si)y), we 
may take S = se with £ = 2e~^ + 1. 

Proof: We may assume \V\ > S, as otherwise we can let P and Q be the trivial partitions into 
singletons, and it is easy to see the lemma holds. We will define a sequence of partitions Pi,P2, ■ ■ ■ of 
equitable partitions, with Pj+i a refinement of Pj and g(Pj+i) > q{Pj) + e/2. Let Pi be an arbitrary 
equitable partition of V consisting of si = s parts. Suppose we have already found an equitable 
partition Pj : V = ViU .. .UVk with k < Sj. 

Let P{x,£) = X as in Lemma |5. II and 6{x) = 2~^ "" as in Lemma 15.71 We apply Lemma 15.71 

to each part Vi of the partition Pj to get a partition of each part Vi = Vn U . . . U Vi^. of P,, into parts 
each of cardinality at least 6\Vi\, where 5 = 5(7) and 7 = f{k) ■ (3 with /3 = (3{f{k), k), such that each 
part Vih is 7-regular. Note that 6~^ is at most triple-exponential in a polynomial in k/f{k). For each 
/c-tuple i = (£1, . . . ,ik) G [hi] X • • • X [hk], by Lemma [5TT] there is an /(A:)-regular partition fCi of the 
cylinder Vu^ x • • • x V^^. into at most f3~^ cylinders such that, for each K S fCi, \Vi£.{K)\ > /3\Vii^\. 
The union of the ICi forms a partition /C of Vi x • • • x V^ which is strongly f (k) -regular . 
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Recall that Q = Q{IC) is the partition of V which is the common refinement of all parts Vi{K) with 
i G [A;] and K ^ IC. The number of parts of /C is at most 5~^fi~^, and hence the number of parts of Q 
is at most k2^'^ "' . Thus, the number of parts of Q is at most quadruple-exponential in a polynomial 
in k/f{k). Let Pj+i be an equitable partition into 4e~^|Q| parts with q{Pj^i) > q{Q) — |, which exists 
by Lemma 15.81 Hence, there is an absolute constant c such that 

\p,+i\<u{{k/f{k)r)<s,+^. 

If q{Q) < q{Pj) + e, then we may take P = Pj and Q = Q{JC), and these partitions satisfy the desired 
properties. Otherwise, q{Pj+i) > q{Q) — f > q{Pj) + f 5 and we continue the sequence of partitions. 
Since q{Pi) > 0, and the mean square density goes up by more than e/2 at each step and is always at 
most 1, this process must stop within 2/e steps, and we obtain the desired partitions. □ 



5.5 A tower-type bound for the key corollary 

In the previous subsection, we established the strong cylinder regularity lemma with a tower-type 
bound. We next use this result to deduce a tower-type bound for Lemma 11.21 which is the key 
corollary of the strong regularity lemma, and easily implies the induced graph removal lemma as 
shown below. We recall the statement of Lemma 11.21 below. 

Lemma 5.11 For each < e < 1/3 and decreasing function / : N — )■ (0, e] there is 5' = 5'{e,f) such 
that every graph G = {V,E) with \V\ > 5'~^ has an equitable partition y = Vi U . . . U V^ and vertex 
subsets Wi C Vi such that \Wi\ > 6'\V\, each pair {Wi,Wj) with l<i<j<kis f{k)-regular, and 
all hut at most ek"^ pairs I < i < j < k satisfy {diVi, Vj) — d{Wi, Wj)\ < e. Furthermore, we m,ay take 
6' = g^, where S = (^, s, f) is defined as in Lemma [5.10\ and s = 2e~^ . 
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Proof: Let a = ^, s = 2e~ , and 6' = g^, where S = S{a,s,f) is as in Lemma FS-lUi We apply 
Lemma 15.101 with a in place of e. We get an equipartition P : V = ViL) . . .L)Vk with s < k < S and a 
strongly /(/c)-regular partition IC of Vi x ■ ■ ■ x V^ into cylinders such that the refinement Q = Q{IC) 
of P has at most S = S{a,s,f) parts and satisfies q{Q) < qiP) + a. Since \V\ > 5'~^ = 85"^, and 
P is an equipartition into k < S parts, the cardinality of each part Vi ^ P satisfies \Vi\ > y. By 
Lemma |5.9| as 2^'^a^'^ = e, the cylinder partition /C is e-close to P. Hence, at most an e-fraction of 
the fc-tuples {vi, . . . , Vk) & Vi x ■ ■ ■ x Vk belong to parts K = Wi x • • • x Wk of /C that are not e-close 
to P. Since <5(/C) has at most S parts, the fraction of fc-tuples (fi, . . . , Vk) G Vi x • • • x Vfc that belong 
to parts K = Wi x • • • x Wk of /C with \Wi\ < jg\Vi\ for at least one i G [k] is at most jg ■ S = | . 
Therefore, at least a fraction I — f{k) — e— ^ > of the fc-tuples {vi, . . . ,Vk) G Vi x ■ ■ ■ xVk belong to 
parts K = Wi X ■ ■ ■ X Wk of /C satisfying K is strongly /(A:)-regular, \Wi\ > jg\Vi\ > S'\V\ for i € [k], 
and K is e-close to P. Since a positive fraction of the A;-tuples belong to such K, there is at least 
one such K. This K has the desired properties. Indeed the number of pairs 1 < i ^ j < k for which 
|(i(Wj, Wj) — d(yi,Vj)\ > e is at most e/c^ and hence the number of pairs 1 < i < j < k for which 
\d{Wi, Wj) - d{Vi, Vj)\ > e is at most ek"^/! + k< ek"^. This completes the proof. □ 
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We next discuss how to obtain the induced graph removal lemma from Lemma 11.21 This is a bit 
easier to obtain than in [5] because Lemma [1.21 has the additional property that the subsets Wi in the 
cylinder K are e-regular. We finish this section by giving this proof and discussing the bound it gives 
for the induced graph removal lemma, which is a tower in h of height polynomial in e"^. We first 
need the following counting lemma, which is rather standard (see, e.g.. Lemma 3.2 in Alon, Fischer, 
Krivelevich, and Szegedy [S] for a minor variant). We omit the proof. 

Lemma 5.12 If H is a graph with vertices 1, . . . , /i, and G is a graph with not necessarily disjoint 
vertex subsets W\, . . . , W^ such that every pair {Wi, Wj) with l<i<j<his '^-regular with 7 < -rrff^ , 
\Wi\ ^ 7~^ for 1 < i < h and, for 1 < i < j < k, d{Wi,Wj) > r] if {i,j) is an edge of H and 

(h\ 

d{Wi,Wj) < 1 — r] otherwise, then G contains at least (f) |Wi| x ••• x \Wh\ induced copies of H 
with the copy of vertex i in Wi . 

We finish the section with a quantitative version of Theorem 11.31 

Theorem 5.1 There is an absolute constant c such that for any graph H on h vertices 1, . . . ,h and 
< e < 1/2 there is 5 > with 6~^ = tj{h) with j = 0(e~^) such that if a graph G on n vertices 
has at most 6n^ induced copies of H, then we can add or delete en?' edges of G to obtain an induced 
H-free graph. 

Proof: Let ry = |. Let 6 = [j) 6'^, where 5' = 6'{r],f) as in Lemma [5. Ill and f{k) = -^r]^ ■ If the 
number of vertices satisfies \V\ < S~^'^, then (5|y|'' < 1 and there are no induced copies oi H, in which 
case no edges of G need to be modified. We can therefore assume that \V\ > 6~^'^ = (|) 6'~^. As 
\V\ > 6'~^, we can apply Lemma 15.111 to graph G = {V,E) with i] in place of e and / as above. We 
obtain an equitable partition F = Vi U . . . U V^ and vertex subsets Wi C Vi such that \Wi\ > 6'\V\ > 
(I) , the cylinder Wi x • • • W^ is strongly /( A;) -regular, and all but at most r]k'^ pairs I < i < j < k 
satisfy \diWi,W,) - diV^,V,)\ < ??. 

The above counting lemma shows that if there is any mapping cj) : [h] ^ [k] such that (i(W^(j), VF^q)) > 
r] for (z,j) an edge of H and (i(W(^(i), W^^q)) < 1 — 77 for i,j distinct and nonadjacent in H, then G 

contains at least (5) |M^i| x • • • x \Wh\ > Sn^ induced copies of H. Hence, no such mapping cf) exists. 
That is, for every mapping : [/i] — )• [k], there is an edge {i,j) for which d{W^(^i-^, W^pf^j-^) < rj 01 distinct 
i,j that are nonadjacent in H with d(VF0(j), ^^^(j)) > I — rj. 

For each pair {Wi, Wj) for which d(Wi, Wj) < rj, delete all edges between Vi and Vj, and for each pair 
{Wi, Wj) for which d{Wi, Wj) > I — r], add all edges between Vi and Vj. Let G' be this modification 
of G. By the previous paragraph, there are no induced copies of H in G' . 

We have left to show that few edge modifications are made in obtaining G' from G. If a pair (Wi, Wj) 
for which edges were modified satisfies \d{Wi,Wj) — d{Vi,Vj)\ < rj, then the density between the two 
sets was only changed by at most 2rj. The number of 1 < i < j < A; for which |(i(Wi, Wj) — d{yi, Vj)\ > rj 
is at most •qk'^. Since Vi, . . . , V^ is an equipartition into nonempty parts, at most rjk'^ ' ^ (^) = Arjn^ 
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edges are changed between such pairs. In total at most 27^(2) +Ariv? < b-qn? < er? edges were changed 
in order to obtain G' from G. 

From Lemma 15.111 we have b' = g^, where S = S{a,s,f) is the function from Lemma 15.101 with 
a = ^, s = 2r]~^ and f{k) = 4^^^. From Lemma 15.101 S{a, s, /) will be at most a tower in h whose 
height is proportional to 77"^. Therefore, by the choice of t] and 6 in the above proof of the induced 
graph removal lemma, we indeed get the desired tower-type bound. This also completes the proof of 
Theorem Ol D 



6 Regular approximation lemma 

In this section we show how to derive the regular approximation lemma from Tao's regularity lemma, 
as discussed in Subsection II. 5[ The key lemma is Lemma 16.11 which shows how to turn a bipartite 
graph into a regular pair by changing some edges according to a weak regular partition. We use the 
notation x = y ziz e to denote the fact that y — e<x<y + e. 

It will be helpful to use the Hoeffding-Azuma inequality for concentration of measure. Say that a 
random variable -'^(i^) on an n-dimensional product space Vt = HILi ^i is Lipschitz if changing oj in 
any single coordinate affects the value of X{uj) by at most one. The Hoeffding-Azuma inequality (see, 
e.g., [To]) provides concentration for these distributions. 

Theorem 6.1 (Hoeffding-Azuma Inequality) Let X be a Lipschitz random variable on an n- 
dimensional product space. Then for any t > 0, 

P[|A:-E[A:1| >tl <2exp<^ 

[ 2n 

For a bipartite graph across parts A and B, and partitions A : A = Ai L) . . . L) Ar and B : B = 
BiU...UBs,let q{A, B) = d^{A, B) and q{A, B) = Y^ij ^^jjff c^^(^i; -^j) be the mean square density 
across the partitions A and B. 

Lemma 6.1 Let < (5 < 1. Suppose A,B are disjoint vertex subsets of a graph with \A\ > \B\ > 8(5~^ 
and d{A, B) = rj. Suppose further that A : A = AiU . . .U A^ and B : B = BiU . . . U Bs form a weak 
6-regular partition of the pair (A,B), i.e., for all S C A and T C B, we have 



J^ J] 1^, n S\\BjnT\d{A,Bj) - d{S,T)\S\\T\ 

i=l j=l 



<5\A\\B\. 

Then, one can add or remove at most i 6 + {q{A, B) — q{A, B)) ' J |j4||i?| edges across {A,B) and 
thus turn it into a 25^'^ -regular pair satisfying d{A, B) = rj ± 5. 

Proof: Let Uij = d{Ai, Bj) — d{A, B). If Oij > 0, we delete each of the edges connecting Ai and Bj 
independently with probability ^^rj^^-r- If ctij < 0, we add each of the nonedges between A^ and Bj 
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with probability ~ i_cI(a. b) ' Clearly the expected value of d{A, B) after these modifications is r]. By 
the Hoeffding-Azuma inequality, the probability that the new density deviates from r/ by more than 5 
is at most 

2exp {-^1^} = 2exp {-5M\\B\/2] < 1/4. 

Also, the expected number of edges changed is 

Y,Wi,Mi\\Bj\ = J^|(i(^„5j)-rf(A5)||^*||5il = |^||5|5^|rf(A„SJ)-d(^,S)|p^g,• 
< \A\\B\ r£^{d{A,,Bj)-d{A,B)fpiqA =\A\\B\{q{AB)-q{A,B)f'\ 

where pi = \Ai\/\A\ and qj = \Bj\/\B\ and in the inequality we used the Cauchy-Schwarz inequality. 
By the Hoeffding-Azuma inequality, the probability that the number of edges changed deviates by 
more than (5|A||i?| from its expected value is at most 

2exp|-MMl!| =2exp{-<^Vl|i?l/2} < 1/4. 

Consider now two subsets A' C A and B' C B. As {A,B) was initially weak (5-regular, the expected 
value of e{A' , B') differs from ?/|j4'||i3'| by at most (5|yl||i?|. By the Hoeffding-Azuma inequality, we get 
that the probability that e{A',B') deviates from its expected value by more than 5|A||i?| is at most 

2exp|-i|^M!| < 2exp{-5Vl|i3|/2} < 2exp{-4|^|} < 2-I^HB|-2^ 

where we use \A\ > \B\ > 85~^. As there are 2' I"*"' ' choices for {A' , B'), we get that with probability 
at least 3/4, all pairs {A', B') are within 2(5|A||i?| edges of having edge density ?/. To recap, we get that 
with probability at least 1/4 we made at most ( 5 + {q{A, B) — q{A, B)) ' j |j4| |i3| edge modifications, 
d{A,B) = 1] ziz 6 and all subsets A' C A, B' C B are within 2(5|j4||i?| edges from having edge density 
r/. Hence, there is such a choice for these edge modifications, and we claim that this implies that 
the pair {A, B) is 25^' ^-regular. Indeed, otherwise there would be A' C A, B' C B, with \A'\ > 
26^/^\A\, \B'\ >25^/^\B\, and \d{A',B')-d{A,B)\ > 25^3^ ^hich implies that A', S' differs by at least 
25V3|^'||^'| > (2(5^/^)^1^! |i?| = 8(5|A||i?| edges from having edge density d{A,B), a contradiction. 
This completes the proof. □ 

We next use Lemma l6.1l to deduce the regular approximation lemma from Tao's regularity lemma. 

Proof: Let (5 : N ^ (0, 1) be defined by 6{t) = min (s^,e/2) . Let eo = (e/2)2. Let To = To{5, eo, s) 



32p"' 

be the bound on the number of parts in Tao's regularity lemma and T = 1QTq/6{Tq)'^. If the number 
n of vertices of the graph G satisfies n < T, then we can partition G into parts of size one, and the 
desired conclusion is satisfied in this case. Hence, we may assume n > T. By Tao's regularity lemma, 
the graph G has an equitable vertex partition P into t > s parts and an equitable vertex refinement 
Q into at most Tq parts which is weak 6{t)-regu\&r such that q{Q) < q{P) + eo- 
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For each pair of parts {A, B) of partition P, let A and B denote the partitions of A and B given by 
partition Q. Since Q is a weak 5(t)-regular partition, and A and B have cardinahty at least [jj > ^, 
then the partitions A and B form a weak 4t^(5(i)-regular partition. Note that 4i^(5(t) < ^V-- 
Since |^|, \B\ > ^ > 8/(5(t)^, we may apply Lemma l6. II to the graph between A and B. That is, we 
may change at most [6{t) + {q{A,B) — q{A,B)) ' ) |j4||i?| edges across A and B and, in so doing, 



aitfV^'' 



make {A, B) a 5'(t)-regular pair, where we used that g{t) = 2 ( ^^y- j . In total, the number of edges 
we change to obtain a graph G' which is (7-regular with respect to partition P is at most 

Y, {m + {q{A,B) - q{A,B)fl^) \A\\B\ < (<5(t) + el^^^ < en'', 
A,B&P 

where we used Jensen's inequality for the concave function h{x) = x^'^, the inequality q{Q) < q{P)+eQ, 
and the bounds 5{t) < e/2, eo = (e/2)^. To complete the proof, we recall that the number of parts in 
partition P is at least s and at most Tq = Tq{5, eo, s). □ 



7 Frieze-Kannan weak regularity lemma 

In this section we prove Theorem 11.41 which provides a lower bound on the weak regularity lemma. 
For a vertex partition P : V = ViL) . . .UVk of a graph G = {V, E), let 

fp{A,B) = f^{A,B) = e{A,B)- J^ d{Vi,Vj)\AnVi\\B nVj\, 

l<i,j<k 

which is the difference between the number of edges between A and B and the expected number of 
edges based on the densities across the pairs of parts of the partition and the intersection sizes of A 
and B with the parts. We call a partition P of the vertex set of a graph G = {V, E) weak e-regular if 
it satisfies 

\fp{A,B)\<e\V\'' 

for all A,B^V. Recall that the weak regularity lemma states that for each e > there is a positive 
integer k{e) such that every graph has an equitable weak e-partition into at most k[e) parts. Moreover, 
one may take k{e) = 2'-^''^ ' . We will prove that the number of parts required in the weak regularity 
lemma satisfies k{e) = 2^*-'^ ', thus matching the upper bound. 

The following simple lemma of Frieze and Kannan (see Lemma 7(a) of |20] ) shows that the notion of 
weak regularity is robust. 

Lemma 7.1 // a partition is weak e-regular, then any refinement of it is weak 2e-regular. 

The robustness of weak regularity described by Lemma [7.1l is not shared by the usual notion of regular 
partition. For example, for any fixed e > and positive integer i, almost surely any partition into t 
parts of a uniform random graph on sufficiently many vertices is e-regular, while any partition of the 
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vertex set into parts of size 2 is not (e, 5, r/)-regular with e = 1, 6 = rj = 1/2. This is because almost 
surely in any such partition, between most pairs of parts of size 2, there wih be at least one edge and 
at least one nonedge. 

What we will actually prove is the stronger result that any weak e-regular partition must have 2^*^ ' 
parts, whether it is equitable or not. The corresponding regularity lemma, which is an immediate 
corollary of the usual weak regularity lemma, is the following. 

Lemma 7.2 For each e > there is a positive integer k*{e) such that every graph G = {V^E) has a 
vertex partition P with at most k*{e) parts which is weak e-regular. 

In the other direction, the equitable version of the weak regularity lemma also follows from Lemma [7.2[ 
This is because of the robustness property discussed in Lemma 17.11 above, that is, any refinement of 
a weak e-regular partition is a 2e-regular partition. By arbitrarily refining a not necessarily equitable 
partition into an equitable partition (except for a small fraction of vertices, which we distribute evenly 
amongst the other parts), we get an equitable weak 3e-partition whose number of parts is only a factor 
polynomial in e~^ larger. 

In order to prove the lower bound for weak regularity, we will need to perform some further reductions. 
We first state a bipartite variant which can easily be shown to be equivalent to Lemma 17.21 For a 
bipartite graph G = {U,V,E) with \U\ = \V\ = n, partitions Pi : [/ = C/i U . . . U [/^ and P2 : V" = 
Vi U . . . U Vfc', and vertex subsets A C U and B C V, let 

k k' 

fp,,pM,B) = f^,,p,{A'B) = e{A,B) - ^ ^d(C/„ v^OjAn c/i||s n y,|. 

i=l j=l 

We call the pair of partitions Pi,P2 weak e-regular with respect to the bipartite graph G if 

\fp,^P,{A,B)\<en' 
for aXlAcU and B (ZV. 

Lemma 7.3 For each e > there is a positive integer k'{e) such that every bipartite graph G = 
{U,V,E) with parts of equal size has partitions Pi of U and P2 of V each with at most k'{e) parts 
which form a weak e-regular partition. 

To prove Theorem 11.41 it suffices to show k'{e) = 2^'-^ '. Indeed, this follows from the bound 
k'{e) < k*{e/2). This inequality follows from first considering a single weak e/2-regular partition P 
for the bipartite graph G into at most k*(e/2) parts, and then refining it into a partition P' with at 
most 2/c*(e/2) parts based on the intersections of the parts of P with U and V. By Lemma |7.H P' is 
a weak e-regular partition. Letting Pi be the parts of P' in U and P2 be the parts of P' in V, the pair 
Pi,P2 form a weak e-regular partition, each with at most k*{e/2) parts, so that k'{e) < k*{e/2). 

To get a lower bound for the weak regularity lemma, we do not need to show the other direction 
of the equivalence between the weak regularity lemma and Lemma 17. 3| that Lemma 17.31 implies the 

50 



weak regularity lemma. However, this is rather simple, so we sketch it here. From a graph G we can 
consider the bipartite double cover of G, which is the tensor product of G with A'2. Applying Lemma 
17.31 we get a pair Pi,i-*2 of partitions of V{G) which form a weak e/2-regular partition with respect 
to the bipartite double cover of G. Refining the two partitions Pi, P2 of V{G), we get by Lemma |7. II 
a weak e-regular partition for G, thus establishing k*[e) < A;'(e/2)^. 

The following technical lemma will allow us to construct a weighted graph rather than a graph. A 
similar idea is present in the lower bound construction of Gowers [22] for Szemeredi's regularity lemma. 
Let VF be a [0, l]-valued n x n matrix. We view W as a. weighted graph with parts U and V, where 
U and V denote the set of columns and rows, respectively, of W. Let ewi^,B) = J2a€AbeB W{a,b) 
and dw {A, B) = '-^f^. 

Lemma 7.4 Let M be an n x n matrix with entries in the interval [0,1]. Let G = {U,V,E) be 
a bipartite random graph with \U\ = \V\ = n and edges chosen independently given by M and let 
9 = An~^''^. With probability at least 1 — e~^", we have \eM{A, B) — edA, B)\ < On^ for every pair of 
sets AcU, B CV. 

Proof: Given two vertices u £ U and v £ V, let a{u, v) be the random variable G{u, v) — M{u, v) 
(where G has been identified with its adjacency matrix). The mean of a{u,v) is zero for all u,v and 
the modulus of a{u,v) is at most 1. Hence, by the Hoeffding-Azuma inequality (Theorem 16. ip . given 
two sets A C U and B C V, the probability that 



Y^ aiu,v) 
{u,v)gAxB 



> On'^ 



is at most 2exp {-(6'n^)2/(2|^||P|)} < 2exp{-8n}. Summing over all A C U and B C V, the 
probability that there are subsets A C U and B C V with \eG{A,B) — eMiA,B)\ > 9n^ is at most 

22n . 2e-8n < g-4n_ □ 

For partitions Pi : U = UiU . . .UUk and P2 : V = ViU . . .UVk', let 

k k' 

fp,,pM,B) = ew{A,B) -YYdw{Ui,Vj)\U,nA\\Vj n B\. 

We say that partitions Pi , P2 form a weak e-regular partition ofWi(\ fp^,P2 {A, B) \ < ev? for all subsets 
^ C [/ and P C y. 

Corollary 7.1 Suppose W = {U,V,E) is an edge-weighted graph with weights in [0,1] and \U\ = 
\V\ = n. Let G = {U,V,E) be a bipartite random graph with \U\ = \V\ = n and edges chosen 
independently given by W and let 9 = 4n~^'^. With probability at least 1 — e~^", every pair of 
partitions Pi : U = Ui U . . . U Uk and P2 : y = Vi U . . . U V^' which form a weak e-partition for G also 
form a weak (e + 29)-regular partition for W . 
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Proof: By Lemma [7^ with probability at least 1 — e ^'^j we have |eG'(74, B) — ew{A, B)\ < 9v? for 
every pair of vertex subsets A C U and B C V. Suppose this indeed holds. For graph G, we have 

k k' 

/F,,p,(A i?) = eG(A i?) - J^ J^ dG(^i, v:,-)!^ n [/ills n v,-|. 

The first term is within 6n? of ew{A, B). The second term is the average of edA', B') over all subsets 
A' C U and B' C V with \A' D Ui\ = \A D Ui\ for all i and \B' D Vj\ = \B D Vj\ for all j, and 
hence is within Oii'^ of the corresponding average of ew{A',B') over all of the same pairs {A',B'). 
By the triangle inequality, for W, we get that for all A C U and B C V, we have \fp p {A, B)\ < 
\f^^p^{A,B)\ + 2e-n? < {e + 29)Ti?. Hence, Pi,P2 also form a weak (e + 26')-regular partition for W. □ 

From Corollary 17.11 and the previous remarks, to obtain the desired lower bound in Theorem 11.41 on 
the number of parts in the weak regularity lemma it suffices to prove a lower bound of the form 2^'*^ > 
on the number of parts fco(e) in the following weak regularity lemma for weighted bipartite graphs. 

Lemma 7.5 For each e > there is a positive integer kQ{e) such that every edge-weighted bipartite 
graph G = {U, V, E) with weights in [0, 1] and parts of equal size has partitions Pi of U and P2 of V 
each with at most A;o(e) parts which form a weak e-regular partition. 

Lemma 17.51 is also known as the weak matrix regularity lemma. This is because it provides, for any 
nx n matrix with entries in [0, 1], partitions of the rows and columns into a bounded number of parts, 
such that the sum of the entries in any submatrix (which is the product of a set of rows and columns) 
is within e?i^ of what is expected based on the intersections with the parts and the density between 
the parts. 

Our goal is to construct a bipartite graph G with edge weights in [0, 1] which provides a lower bound 
of the form A;o(e) = 2 '"^ '. Suppose < e < 2"^*^. Consider the following weighted bipartite graph G. 
The graph has parts U and V each of order n = 2^ "^ . Let r = 2~'^^e~^ and a = 2^^e. Consider, for 
1 < i < r, equitable partitions U = UqUUI and V = Vq L) V^ picked uniformly and independently at 
random. For vertices u £ U and v £ V, let s(n, v) be the number of z E [r] for which there is j G {0, 1} 
such that u G C/| and v G VL and t{u,v) be the number of i G [r] for which there is j G {0, 1} such 
that u G Uj and v G Vj* •, so that s{u,v) + t{u,v) = r. Let W{u,v) = ^ + {s{u,v) — t{u,v))a. We 
define the weight w{u,v) between u and v as follows. If < W{u,v) < 1, then w{u,v) = W{u,v), if 
W{u, v) < 0, then w{u, v) = 0, and if W{u, v) > 1, then w{u, v) = 1. 

Call a pair (u,v) £ U xV extreme if |VF(u, v) — 1/2| > 1/4, and a vertex u £ U nice if it is in at most 
n/8 pairs {u, v) with v £ V which are extreme. 

Lemma 7.6 With probability at least 3/4, all but at most e~^^^n vertices of U are nice. 

Proof: Fix a pair (n, v) £ U xV. The event (u, v) is extreme is the same as \s{u, v) — t{u, v)\a > 1/4, 
or equivalently that \s{u, v) — r/2\ > g^. The number s{u, v) is a sum of r independent variables, with 
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values or 1 each with probabihty 1/2, and hence follows a binomial distribution with mean r/2. By 
Chernoff's bound 1^, the probability that \s{u,v) - r/2\ > ^ is less than 2e~2(i/(8°))'A = 2e-^\ 
Hence, by linearity of expectation, the expected number of extreme pairs {u,v) £ U x V is less than 
2e~^ n^. Therefore, by Markov's inequality, the probability that there are at least 8e~^ n? extreme 
pairs (n, v) is less than 1/4. Since any nice vertex is contained in at most n/8 extreme pairs, we see 
that with probability at least 3/4, all but at most 64e~^ n < e~^^^n vertices in U are nice. □ 

For h £ [r], we let Sh{u,v) denote the number of i € [r] \ {h} for which there is j € {0, 1} such that 
u G U^A and v £ VJ, and th{u, v) be the number of i € [r] \ {h} for which there is j G {0, 1} such that 
u G Uj and v G ^1-7) so that Sh{u,v) + th{u,v) = r — 1. Let Wh{u,v) = 5 + {sh.{u,v) — tfi{u,v))a. 
As above, we define the weight Whiu,v) by Wh{u,v) = Wh{u,v) if < Wh{u,v) < 1, Wh{u,v) = if 
Wh{u, v) < 0, and Wh{u, u ) = 1 if Wh{u, v) > 1. 

Lemma 7.7 Suppose u G U^ and we are given \wh{u, z;) — 1/2| < 1/2— a for at least Xn vertices v £V , 
and we do not yet know the partition V = V^^JV^^ . Then the probability that d^{u, V^) — dyj{u, V^_-) > 
a/2 is at least 1 — ^ . 

Proof: Consider the event E that 

^ Wh{u,v) - ^ Wh{u,v) > an/A. 

Note that the expected value of the left hand side is 0. Recall that a hypergeometric distribution is at 
least as concentrated as the sum of independent random variables with the same values (for a proof, 
see Section 6 of [23). By the Hoeffding-Azuma inequality (Theorem 16. ip . the probability of event E 
is at most 



r (an/8)2l , 7 „ , 1 



where in the last inequality we use 0<e<2 ^'^, r = 2 '^^e ^, n = 2^ ''^ , and a = 2^^e. 
For a fixed u, if \wh{u, u) — 1| < ^— a, then w{u, v) = Wh{u, v)+a if v is in V^ and w{u, v) = Wh{u, v) — a 
if V is in Vi_j. For all v, w{u,v) is within a of Wh{u,v). Therefore, letting w{u,S) = J2ses'^('^^^)^ 
we see, since \wh{u,v) — ^| < | — a for at least |n vertices of v, that 

w{u, y/') - w{u, Vi_j) > -an - -an + Wh{u, Vj") - Wh{u, Vi_j) > -an - -an = -n. 
The result follows. □ 

Call a nice vertex u £ U very nice if for each h £ [r] and j £ {0, 1} with u £ [/■ , 

d{u, Vj") - d{u, Vllj) > a/2. 

Corollary 7.2 With probability at least 3/4, every nice vertex u is very nice. 
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Proof: Given u is nice, then for each /i G [r], we must have |W/i(n,f) — 1/2| < 1/2 — a for all but at 
most |n vertices v ^ V . The probabihty that there is a vertex which is nice but not very nice is by 
Lemma 17.71 at most rn ■ jp^ < 1/4, which completes the proof. □ 

From Lemma 17.61 and Corollary 17. 2[ we have the following corollary. 

Corollary 7.3 With probability at least 1/2, the graph G has the following properties. 

• The number of vertices in U which are not nice is at most e~^^^n. 

• Every nice vertex is very nice. 

Consider the random bipartite graph B = B(n, r) with vertex parts U and [r] where i G [r] is adjacent 
to u G U ii u £ Uq. By Lemma [2TT] with /z = 1/4, as r > 32 log n, we have the following proposition. 

Corollary 7.4 With probability at least 3/4, for each pair u, u' G U , the number of i for which u and 
u' both belong to U^ for some j G {0, 1} is less than |r. 

Hence, with probability at least 1/4, the graph G satisfies the properties in Corollaries 17.31 and 17.41 
Fix such a graph G. 

Theorem 7.1 No partitions Pi:UiU...UUkofU and P2 : ViU . . .UVk' ofV with k < n/2 form a 
weak e-regular partition. As n/2 > 2^ ^ , we therefore have fco(e) > 2^ "^ for < e < 2~^^ . 

Theorem 17.11 gives a lower bound on the number A;o(e) of parts for the weak matrix regularity lemma 
(Lemma l7.5p with approximation e. Before we prove this theorem, we remark that it has no restriction 
on the number of parts of partition P2, and further shows that Pi has to be almost the finest partition 
(partition into singletons) to obtain a pair of partitions which are weak e-regular. 

Proof: Suppose for contradiction that the partitions Pi and P2 are weak e-regular. That is, 
Ifp^^P^iA, B)\ < en^ for all subsets A(ZU and B CV. 

Fix for now Ut with \Ut\ > 2. Call the pair (i,t) G [r] x [k] useful if \Ut n C/j| > |C/t|/32 for j G {0, 1}. 
Let Mt be the number of i G [r] for which the pair (i,t) is useful. The sum 

St = Y, \Ut n U^WUt n Ui\ < r|i7i|V32 + Mt\Ut\''/A 

iG[r] 

is precisely the number of triples u,u',i with u,u' distinct elements of Ut and i G [r] for which u and 
u' are not in the same set in the partition U = UqL) U'l- By Corollary 17. 4^ the sum St is at least 
ir(l^*l) > |C/t|2r/16. Hence, 

r\Ut\^ /32 + MtlUtl^A >St> \Ut\\/W. 

We thus have Mt > r/8. 
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Since Mt > r/8 for each t for which \Ut\ > 2 and there are at most k parts in partition Pi of order 1, 
there is an i for which partition i satisfies that at least (n — k)/8 > n/16 vertices u £ U are in Ut with 
the pair {i,t) useful. Fix such an i. For each t for which {i,t) is useful and all but at most |C/i|/64 
vertices in Ut are nice, for j G {0, 1}, let Utj be a subset of Ut n [/* of cardinality exactly [|[/i|/64], 
and Aj denote the union of all such Utj ■ Recall from Corollary 17.31 that there are at most e 



-100 



n 



vertices in U which are not nice. Hence, there are at most 64 • e~^^^n vertices in U which belong to 
a Ut for which the pair (i,t) is useful but there are at least |f7t|/64 vertices in Ut which are not nice. 
Thus, the number of vertices in U which belong to a Ut for which (i, t) is useful and there are at most 
|C/i|/64 vertices in Ut which are not nice is at least 



--64e-i°0n>-. 
16 32 



We thus have |^o| = l^il > §/64 = 2-'^^n. 

Note that, by construction, we have for each t £ [k], i £ [k'] and T CV, 

\AonUt\\TnVe\d{Ut,Ve) = \AinUt\\TnVe\d{Ut,Vi). 

Thus, if the partitions Pi, P2 form a weak e-regular partition, we would have to have 

\e{Ao,VJ)-e{Ai,VJ)\<2en\ (29) 

for j E {0, 1}. However, as each u £ Aq is in Uq and is very nice, we have 

diAo,V^)-d{Ao,Vl)>a/2. 

Since |^o| > 2~^^n and \Vj\ = ?i/2 for j G {0, 1}, we have 

e{Ao,V^) - e{Ao,Vl) > 2~'^an^. 

Similarly, 

e{AuVi) - e{Ai,V^) > 2~^'^ar?. 

Adding the previous two inequalities, we have 

e(Ao, Vi) - e{Au V^) + e{Ai,Vl) - e{A^, Vl) > 2-^^an\ (30) 

But, by (I29p for j G {0, 1}, the left hand side of (130p is at most Aev? in modulus, contradicting the 
above inequality and a = 2^^e. This completes the proof. □ 

Remark: While Theorem 17.11 provides for each e only one graph which requires at least 2 '"^ ' parts 
in any weak e-regular pair of partitions, it is a simple exercise to modify the proof to show that all 
blow-ups of G also satisfy this property, thus obtaining an infinite family of such graphs. For a graph 
G on n vertices and a positive integer t, the blow-up G{t) of G is the graph on nt vertices obtained 
by replacing each vertex u by an independent set lu, and a vertex in /„ is adjacent to a vertex in ly 
if and only if u and v are adjacent. 
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8 Concluding remarks 

• Weak regularity lemmas without irregular pairs 

While proving his famous theorem on arithmetic progressions in dense subsets of the integers, 
Szemeredi |39] actually developed a regularity lemma which is weaker than what is now com- 
monly known as Szemeredi's regularity lemma [50]. The following version is a strengthening 
of the original version, as it guarantees that all pairs, instead of all but an e-fraction of pairs, 
under consideration are e-regular. The key extra ingredient is an application of Lemma 15.61 to 
redistribute the small fraction of vertices which are not in regular sets. 

Lemma 8.1 For each < e < 1/2 there are integers k = k[e) and K = K{e) such that the 
following holds. For every graph G = iV^E), there is an equitable partition V = Vi U . . . U Vt 
into at most k parts such that for each i, 1 < i < t, there is a partition V = Vn U . . . U V^j. , with 
ji ^ K, such that for all 1 < i < t and 1 < j < ji the pair (V^, V^j) is e-regular. Furthermore, 
k{e) = 2^ and K{e) = 0{e~^), where C is an absolute constant. 



Szemeredi |39] originally gave a triple exponential upper bound on the number of parts in the 
original regularity lemma, whereas it is now known (see [33]) that the correct bound is single 
exponential. Through iterative applications, the original regularity lemma was used by Ruzsa 
and Szemeredi [3S] to resolve the (6, 3)-problem, and by Szemeredi [35] to establish the upper 
bound on the Ramsey- Turan problem for K4. It is a relatively simple exercise to show that 
Szemeredi's original regularity lemma implies the Prieze-Kannan weak regularity lemma, but 
with a bound that is one exponential worse than the tight bound established in the previous 
section. This can be accomplished by showing that the common refinement of the partitions in 
the original regularity lemma satisfies the Frieze-Kannan weak regularity lemma. 

There are a number of notable properties of Lemma |8.1[ First, all pairs (Vi, Vij) under consid- 
eration in Lemma |8. II are regular. In contrast, Theorem 11.11 shows that we must allow for many 
irregular pairs in Szemeredi's regularity lemma. Second, the bounds are much better than in 
Szemeredi's regularity lemma. The bounds on the number of parts is only single-exponential, 
instead of the tower-type bound which appears in the standard regularity lemma. Furthermore, 
each of the partitions of V have at most K{e) = 0(e~^) parts. Indeed, this can be established 
by first proving any bound on K{e), and then using the following additive property of regularity 
to combine parts. Namely, applying Lemma [8. II with any bound on K{e) and with e^/4 in place 
of e, and, for each i, partitioning V into 0{e~^) parts, each part consisting of the union of parts 
Vij for which diVi, Vij) lies in an interval of length at most e/2, the following lemma shows that 
Vi together with each part of the new partition forms an e-regular pair. 

Lemma 8.2 Let < e < 1 and a = e^/4. Suppose A,Bi,...,Br are disjoint sets satisfying 
{A,Bi) is a-regular for 1 <i <r and \d{A,Bi) — d{A,Bj)\ < e/2 for 1 < i,j < r. Then, letting 
B = BiL) . . .L) Br, the pair (A, B) is e-regular. 
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To save space, we omit the details of how to prove Lemmas 18.11 and 18.21 

Another interesting consequence of Lemma 18.11 is that it imphes that every graph on n vertices 
has an e-regular pair in which one part is of size Q{en) and the other part has size 2~^ n. 
It is well known (see [27]), that one can find an e-regular pair in which both parts have size 
2"*^ n. In some applications, having one regular pair is sufficient (see, e.g., [16], [23], |27|). 
and one obtains much better bounds than by applying Szemeredi's regularity lemma. In the 
other direction, there are graphs (see Theorem 1.4 in [31] for a tight result) for which any e- 
regular pair has a part of size 2"*^ n. We can nevertheless guarantee that one part is of size 
r2(en). It seems likely that having such a large part in a regular pair could be useful in some 
applications. 

By iterative application of Lemma 18.11 one can also obtain a version of the Duke-Lefmann-Rodl 
lemma such that all cylinders in the partition are e-regular. In fact, using Lemma 15.71 one can 
further guarantee that all cylinders in the partition are strongly e-regular, and the bound is still 
of constant-tower height. 

A part irregular with no other parts 

In the proof of Theorem ll.lt we found a graph G such that for any partition into k parts there are 
at least Ok parts Vi for which there are at least O^^rjk pairs (T^, Vj) which are not (e, (5)-regular, 
where < < 1 is a fixed constant. Is it possible to improve this result so that all parts are 
in r]k irregular pairs? The answer is no, as Lemma 18.11 implies that any graph has an equitable 
partition into only 2*^ parts in which one part is e-regular with all the other parts. 

A single regular subset 

It would be interesting to determine tight bounds for the size of the largest e-regular subset 
which can be found in every graph. In Lemma [5. 2 1 we showed that every graph G = (V, E) must 
contain an e-regular subset U of size at least 2~'^ \V\. On the other hand, a result of Peng, 

Rodl and Rucihski [31] implies that there are graphs G = {V, E) which contain no e-regular 
subset of size e'^'^ \V\. We conjecture that the correct bound is single exponential, though the 
power may be polynomial in e~^. 

Equitable partitions and not necessarily equitable partitions 

In the regularity lemmas considered in this paper, we often assume the partitions we consider 
are equitable partitions. It is not difficult to see that this assumption is non-essential and the 
bounds do not change much. Indeed, consider for example the following variant of Szemeredi's 
regularity lemma. 

Lemma 8.3 For each e,6,r] > there is a positive integer M = M(e, 6, rj) for which the following 
holds. Every graph G = {V, E) has a vertex partition 1/ = Vi U . . . U T4 with k < M such that 
the sum of \Vi\\Vj\ over all pairs (V^, Vj) which are not {e,S)-regular is at most r/|yp. 
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The key difference between this version of Szemeredi's regularity lemma and the usual statement 
is that the parts of the partition are not necessarily of equal order, and we measure the regularity 
of the partition by the sum of the products of the sizes of the pairs of parts that are (e, (5)-regular. 
Szemeredi's regularity lemma clearly implies Lemma [8.3l as it further specifies that the partition 
is an equitable partition, and the condition that the sum of | Vi| |V^| over all pairs {Vi,Vj) which are 
not (e, (5)-regular is at most 7?|V^P is then essentially the same as saying that the number of pairs 
(YijVj) which are not (e, 5)-regular is at most ryA;^. To see that Lemma 18.31 implies Szemeredi's 
regularity lemma with similar bounds, first apply Lemma fB.Sl and then randomly refine each part 
Vi = VioUViiU. . .UVij. into parts of size m = j^jja^iyi/M, where a = min(e, 5, 77), except possibly 
ViQ, which can have size less than m as not necessarily each Vi has cardinality a multiple of m. 
The total number of remaining vertices, those in Vq = Ui=i ^o> is less than km < j^a^lV], as 
there are less than m remaining vertices from each of the k parts Vi. Redistributing the vertices 
of Vo evenly among the parts of size m, we get a new equitable partition where each part has size 
between m and at most (1 + ^^0^)771. It is not hard to show using an additional application of 
the Frieze-Kannan weak regularity lemma, that because we randomly refined each part, almost 
surely for all pairs {Vh,Vi) which are (e, (5)-regular, all pairs {Vha,Vij) are (2e, 2(5)-regular. That 
is, the regularity between pairs of parts is almost surely inherited between large vertex subsets. 
It then easily follows that the equitable partition is similar both in the number of parts and the 
degree of regularity to the original partition from Lemma [ 



Because of this equivalence, we get similar lower bounds for regularity lemmas where the par- 
titions are not necessarily equitable partitions. For example, for Lemma 18.31 for some absolute 
constants e,6 > 0, we get a bound on M{e,6,ri) which is a tower of 2s of height Q{r]~^). Simi- 
larly, in Theorem 11.21 and Corollary 11.11 giving lower bounds on the strong regularity lemma, the 
assumption that B is an equitable partition is not needed. 

Finally, as we have already noted in Section [71 it is much easier to show that for the Frieze- 
Kannan weak regularity lemma we do not need to assume that the partition is equitable. This 
is a simple consequence of the robustness of weak regularity under refinement. 

Irregular pairs and half graphs 

A (generalized) half-graph has vertex set Au B with 2n vertices A = {oi, . . . ,««} and B = 
{61, . . . , bn}, in which (oj, bj) is an edge if and only ii i < j (but the edges within A and B 
could be arbitrary). As mentioned in the introduction, any partition of a large half-graph into 
a constant number of parts has some irregular pairs. Malliaris and Shelah [30] recently showed 
that for each fixed k, every graph on n vertices with no induced subgraph which is a half-graph 
on 2k vertices has an e-regular partition with no irregular parts and the number of parts is at 
most e~'^'', where c^ is single-exponential in k. This shows that any construction forcing irregular 
pairs in the regularity lemma, like that given in the proof of Theorem 11.11 must contain large 
half-graphs, of size double-logarithmic in the number of vertices. 

Distinct regular approximations 
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The notion of /-regularity, which appears in the regular approximation lemma, has since ap- 
peared elsewhere. Alon, Shapira, and Stav [U] investigate the question of determining if a graph 
G = {V, E) can have distinct regular partitions. Two equitable partitions U : U = UiL) . . .L)Uk 
and V : V = Vi U . . . U Vk into k parts are said to be e-isomorphic if there is a permutation 
TT : [A;] — )• [k] such that for all but at most 6(2) pairs 1 < i < j < k, \d{Ui, C/j) — d(V^(j), K-(j))| < e. 

They prove that for some f{k) = Q ( "^^^3 1 and infinitely many fc, and for every n > n[k), 
there is a graph on n vertices with two /-regular partitions which are not 1/4-isomorphic. On 
the other hand, they show that if f{k) < min(l//c^, e/2), then any two equitable partitions of G 
into k parts which are each /-regular must be e-isomorphic. 

• Multicolor and directed regularity and removal lemmas 

The proof of Szemeredi's regularity lemma has been extended to give multicolor (see [27j) and 
directed (see [7j) extensions. These imply multicolor and directed generalizations of the graph 
removal lemma. As discussed in [18], the new proof of the graph removal lemma with a log- 
arithmic tower height extends with a similar bound to these versions as well. Axenovich and 
Martin [11] recently extended the strong regularity lemma in a similar fashion to give multicolor 
and directed versions, in order to establish extensions of the induced graph removal lemma. Our 
proof of the induced graph removal lemma with a tower-type bound similarly extends to give 
multicolor and directed versions. 

• On proofs of regularity lemmas 

As noted by Gowers, the constructions in [22] not only show that the bound in Szemeredi's 
regularity lemma is necessarily large, but that, in some sense, the proof is necessary. Any proof 
must follow a long sequence of successively finer partitions, each exponentially larger than the 
previous one. While this notion is hard to make precise, it should be clear to anyone who has 
studied the proof of the regularity lemma and the lower bound construction of Gowers. Theorem 
ll.ll adds further weight to this conviction. Furthermore, the proof of Theorem 1 1.2 1 shows that any 
proof of the strong regularity lemma requires a long sequence of partitions, each of tower- type 
larger than the previous partition. That is, the iterated use of Szemeredi's regularity lemma is 
required in any proof of the strong regularity lemma. 

Ackno'wledgment. We would like to thank Noga Alon for helpful comments. 

Note added. After this paper was written we learned that a variant of Theorem 11.21 was also proved, 
independently and simultaneously, by Kalyanasundaram and Shapira. In the situation of Corollary 
11.21 their theorem gives a lower bound of wowzer-type in yloge^ for the strong regularity lemma. 
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